
SEQUENCE LISTING 



<110> Harms, Jerome S 
Splitter, Gary A 
Eakle, Kurt A 
Bremel, Robert D 

<120>' INDUCIBLE PROTEIN EXPRESSION SYSTEM 

<130> 960296.00463 

<140> 10/763,976 
<141> 2004-01-23 

<150> US 60/442,103 
<151> 2003-01-23 

<160> 29 

<170> Patentln version 3.3 

<210> 1 

<211> 480 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered sequence from virus and plasmid 



<220> 

<221> promoter 

<222> (87).. (432) 

<223> BLV promoter 

<400> 1 

gaaaccagca gcggctatcc gcgcatccat 
atggccgctt tggtcgaggc ggatcctagc 
ttacaactgc tagaaaatga atggctctcc 
aaagatcatg ccgacctagg cgccgccacc 
ccagaaaagc tggtgacggc agctggtggc 
ttcccgaaaa atccacaccc tgagctgctg 
cggccctgtc gagttagcgg caccagaagc 
ctcggtcctg cctcgagaag cttgttatca 



gcccccgaac tgcaggagtg gggaggcacg 
agaaaaataa gacttgattc ccccttaaaa 
cgcctttttt gagggggaat catttgtatg 
gccccgtaaa ccagacagag acgtcagctg 
tagaatcccc gtacctcccc aacttcccct 
acctcacctg ctgataaatt aataaaatgc 
gttcttctcc tgagaccctc gtgctcagct 
caagtttgta caaaaaagct gaacgagaaa 



<210> 2 

<211> 929 

<212> DNA 

<213> Bovine leukemia virus 



1 



<220> 

<221> CDS 

<222> (1)..(927) 

<400> 2 

atg gca agt gtt gtt ggt tgg ggg ccc cac tct eta cat gec tgc ccg 48 
Met Ala Ser Val Val Gly Trp Gly Pro His Ser Leu His Ala Cys Pro 
15 10 15 

gec ctg gtt ttg tec aat gac gtc acc ate gat gec tgg tgc ccc etc 96 
Ala Leu Val Leu Ser Asn Asp Val Thr lie Asp Ala Trp Cys Pro Leu 
20 25 30 

tgc ggg ccc cat gag cga etc caa ttc gaa agg ate gac acc acg cac 144 
Cys Gly Pro His Glu Arg Leu Gin Phe Glu Arg lie Asp Thr Thr His 
35 40 45 

acc tgc gag acc cac cgt ate acc tgg acc gee gat gga cga cct ttc 192 
Thr Cys Glu Thr His Arg lie Thr Trp Thr Ala Asp Gly Arg Pro Phe 
50 55 60 

ggc etc aat gga gcg ctg ttc cct cga ctg cat gtc tec aga gac ccg 240 
Gly Leu Asn Gly Ala Leu Phe Pro Arg Leu His Val Ser Arg Asp Pro 
65 70 75 80 

gee cca agg gee cga cga etc tgg ate aac tgc ccc ctt ccg gee gtt 288 
Ala Pro Arg Ala Arg Arg Leu Trp lie Asn Cys Pro Leu Pro Ala Val 
85 90 95 

cgc get cag ccc ggc ccg gtt tea ctt tec ccc ttc gag egg tec ccc 336 
Arg Ala Gin Pro Gly Pro Val Ser Leu Ser Pro Phe Glu Arg Ser Pro 
100 105 110 

ttc cag ccc tac caa tgc caa ttg ccc teg gec tct age gac ggt tgc 384 
Phe Gin Pro Tyr Gin Cys Gin Leu Pro Ser Ala Ser Ser Asp Gly Cys 
115 120 125 

ccc gtc ate ggg cac ggc ctt ctt ccc tgg aac aac tta gta acg cat 432 
Pro Val lie Gly His Gly Leu Leu Pro Trp Asn Asn Leu Val Thr His 
130 135 140 

cct tgt cct egg aaa gtc ctt ata tta aat caa atg gee aat ttt tec 480 
Pro Cys Pro Arg Lys Val Leu lie Leu Asn Gin Met Ala Asn Phe Ser 
145 150 155 160 

tta etc ccc ccc ttc aat acc etc ctt gtg gac ccc etc egg ttg tec 528 
Leu Leu Pro Pro Phe Asn Thr Leu Leu Val Asp Pro Leu Arg Leu Ser 
165 170 175 

gtc ttt gee cca gac acc agg gga gee ata cgt tat etc tec acc ctt 576 
Val Phe Ala Pro Asp Thr Arg Gly Ala He Arg Tyr Leu Ser Thr Leu 
180 185 190 

ttg acg eta tgc cca get act tgt att eta ccc etc ggc gag ccc ttc 624 
Leu Thr Leu Cys Pro Ala Thr Cys He Leu Pro Leu Gly Glu Pro Phe 
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195 200 205 

tct cct aat gtc ccc ata tgt cgc ttt ccc egg gac tec aat gaa ccc 672 
Ser Pro Asn Val Pro He Cys Arg Phe Pro Arg Asp Ser Asn Glu Pro 
210 215 220 

ccc ctt tea gaa ttc gag ctg ccc ctt ate caa acg ccc ggc ctg tct 720 
Pro Leu Ser Glu Phe Glu Leu Pro Leu He Gin Thr Pro Gly Leu Ser 
225 230 235 240 

tgg tct gtc ccc gcg ate gac eta ttc eta ace ggc ccc cct tec cca 768 
Trp Ser Val Pro Ala He Asp Leu Phe Leu Thr Gly Pro Pro Ser Pro 
245 250 255 

tgc gac egg tta cac gta tgg tec agt cct cag gee tta cag cgc ttc 816 
Cys Asp Arg Leu His Val Trp Ser Ser Pro Gin Ala Leu Gin Arg Phe 
260 265 270 

etc cat gac cct acg eta acc tgg tea gaa ttg gtt get age agg aaa 864 
Leu His Asp Pro Thr Leu Thr Trp Ser Glu Leu Val Ala Ser Arg Lys 
275 280 285 

eta aga ctt gat tea ccc tta aaa tta caa ctg tta gaa aat gaa tgg 912 
Leu Arg Leu Asp Ser Pro Leu Lys Leu Gin Leu Leu Glu Asn Glu Trp 
290 295 300 

etc tec cgc ctt ttt tg 929 

Leu Ser Arg Leu Phe 

305 



<210> 3 
<211> 309 
<212> PRT 

<213> Bovine leukemia virus 
<400> 3 

Met Ala Ser Val Val Gly Trp Gly Pro His Ser Leu His Ala Cys Pro 
15 10 15 



Ala Leu Val Leu Ser Asn Asp Val Thr He Asp Ala Trp Cys Pro Leu 
20 25 30 



Cys Gly Pro His Glu Arg Leu Gin Phe Glu Arg He Asp Thr Thr His 
35 40 45 



Thr Cys Glu Thr His Arg He Thr Trp Thr Ala Asp Gly Arg Pro Phe 
50 55 60 



Gly Leu Asn Gly Ala Leu Phe Pro Arg Leu His Val Ser Arg Asp Pro 
65 70 75 80 
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Ala Pro Arg Ala Arg Arg Leu Trp lie Asn Cys Pro Leu Pro Ala Val 
85 90 95 



Arg Ala Gin Pro Gly Pro Val Ser Leu Ser Pro Phe Glu Arg Ser Pro 
100 105 110 



Phe Gin Pro Tyr Gin Cys Gin Leu Pro Ser Ala Ser Ser Asp Gly Cys 
115 120 125 



Pro Val lie Gly His Gly Leu Leu Pro Trp Asn Asn Leu Val Thr His 
130 135 140 



Pro Cys Pro Arg Lys Val Leu lie Leu Asn Gin Met Ala Asn Phe Ser 
145 150 155 160 



Leu Leu Pro Pro Phe Asn Thr Leu Leu Val Asp Pro Leu Arg Leu Ser 
165 170 175 



Val Phe Ala Pro Asp Thr Arg Gly Ala lie Arg Tyr Leu Ser Thr Leu 
180 185 190 



Leu Thr Leu Cys Pro Ala Thr Cys lie Leu Pro Leu Gly Glu Pro Phe 
195 200 205 



Ser Pro Asn Val Pro lie Cys Arg Phe Pro Arg Asp Ser Asn Glu Pro 
210 215 220 



Pro Leu Ser Glu Phe Glu Leu Pro Leu lie Gin Thr Pro Gly Leu Ser 
225 230 235 240 



Trp Ser Val Pro Ala lie Asp Leu Phe Leu Thr Gly Pro Pro Ser Pro 
245 250 255 



Cys Asp Arg Leu His Val Trp. Ser Ser Pro Gin Ala Leu Gin Arg Phe 
260 265 270 



Leu His Asp Pro Thr Leu Thr Trp Ser Glu Leu Val Ala Ser Arg Lys 
275 280 285 



Leu Arg Leu Asp Ser Pro Leu Lys Leu Gin Leu Leu Glu Asn Glu Trp 
290 295 300 
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Leu Ser Arg Leu Phe 
305 



<210> 4 

<211> 1062 

<212> DNA 

<213> Human T-cell lympho tropic virus type 1 



<220> 

<221> CDS 

<222> (1)..(1062) 

<220> 

<221> promoter 
<222> (1)..(353) 

<400> 4 

atg gcc cac tte cca ggg ttt gga cag agt ctt ctt tte gga tae cca 48 

Met Ala His Phe Pro Gly Phe Gly Gin Ser Leu Leu Phe Gly Tyr Pro 
15 10 15 

gtc tae gtg ttt gga gac ggc gac tgg tgc ccc tgt gta caa ate tct 96 
Val Tyr Val Phe Gly Asp Gly Asp Trp Cys Pro Cys Val Gin lie Ser 
20 25 30 

ggg gga eta tgt teg gcc cgc eta cat cgt cac gcc eta ctg gcc acc 144 
Gly Gly Leu Cys Ser Ala Arg Leu His Arg His Ala Leu Leu Ala Thr 
35 40 45 

tgt cca gag cat cag ate acc tgg gac ccc ate gat gga cgc gtt ate 192 
Cys Pro Glu His Gin lie Thr Trp Asp Pro lie Asp Gly Arg Val lie 
50 55 60 

ggc tea get eta cag tte ctt ate cct cga etc ccc tec tte ccc acc 240 
Gly Ser Ala Leu Gin Phe Leu He Pro Arg Leu Pro Ser Phe Pro Thr 
65 70 75 80 

cag aga acc tct aag acc etc aag gtc ctt acc ccg cca ate act cat 2 88 

Gin Arg Thr Ser Lys Thr Leu Lys Val Leu Thr Pro Pro He Thr His 
85 90 95 

aca acc ccc aac att cca ccc tec tte etc cag gcc atg cgc aaa tac 336 
Thr Thr Pro Asn He Pro Pro Ser Phe Leu Gin Ala Met Arg Lys Tyr 
100 105 110 

tec ccc tte cga aat gga tac atg gaa ccc acc ctt ggg cag cac etc 384 
Ser Pro Phe Arg Asn Gly Tyr Met Glu Pro Thr Leu Gly Gin His Leu 
115 120 125 

cca acc ctg tct ttt cca gac ccc gga etc egg ccc caa aac ctg tac 432 
Pro Thr Leu Ser Phe Pro Asp Pro Gly Leu Arg Pro Gin Asn Leu Tyr 
130 135 140 

acc etc tgg gga ggc tec gtt gtc tgc atg tac etc tac cag ctt tec 480 
Thr Leu Trp Gly Gly Ser Val Val Cys Met Tyr Leu Tyr Gin Leu Ser 
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145 150 155 160 

ccc ccc ate acc tgg ccc etc ctg ccc cac gtg att ttt tgc cac ccc 528 
Pro Pro lie Thr Trp Pro Leu Leu Pro His Val lie Phe Cys His Pro 
165 170 175 

ggc cag etc ggg gec ttc etc acc aat gtt ccc tac aag cga ata gaa 576 
Gly Gin Leu Gly Ala Phe Leu Thr Asn Val Pro Tyr Lys Arg lie Glu 
180 185 190 

gaa etc etc tat aaa att tec ctt acc aca ggg gee eta ata att eta 624 
Glu Leu Leu Tyr Lys lie Ser Leu Thr Thr Gly Ala Leu lie lie Leu 
195 200 205 

ccc gaa gac tgt ttg ccc acc acc ctt ttc cag cct gtt agg gca ccc 672 
Pro Glu Asp Cys Leu Pro Thr Thr Leu Phe Gin Pro Val Arg Ala Pro 
210 215 220 

gtc acg eta aca gee tgg caa aac ggc etc ctt ccg ttc cac tea acc 720 
Val Thr Leu Thr Ala Trp Gin Asn Gly Leu Leu Pro Phe His Ser Thr 
225 230 235 240 

etc acc act cca ggc ctt att tgg aca ttt acc gat ggc acg cct atg 768 
Leu Thr Thr Pro Gly Leu lie Trp Thr Phe Thr Asp Gly Thr Pro Met 
245 250 255 

att tec ggg ccc tgc cct aaa gat ggc cag cca tct tta gta eta cag 816 
lie Ser Gly Pro Cys Pro Lys Asp Gly Gin Pro Ser Leu Val Leu Gin 
260 265 270 

tec tec tec ttt ata ttt cac aaa ttt caa acc aag gee tac cac ccc 864 
Ser Ser Ser Phe lie Phe His Lys Phe Gin Thr Lys Ala Tyr His Pro 
275 280 285 

tea ttt eta etc tea cac ggc etc ata cag tac tct tec ttt cat aat 912 
Ser Phe Leu Leu Ser His Gly Leu lie Gin Tyr Ser Ser Phe His Asn 
290 295 300 

tta cat etc ctg ttt gaa gaa tac acc aac ate ccc att tct eta ctt 960 
Leu His Leu Leu Phe Glu Glu Tyr Thr Asn lie Pro lie Ser Leu Leu 
305 310 315 320 

ttt aac aaa aaa gag gca gat gac aat gac cat gag ccc caa ata tec 1008 
Phe Asn Lys Lys Glu Ala Asp Asp Asn Asp His Glu Pro Gin lie Ser 
325 330 335 

ccc ggg ggc tta gag cct ccc agt gaa aaa cat ttc cgc gaa aca gaa 1056 
Pro Gly Gly Leu Glu Pro Pro Ser Glu Lys His Phe Arg Glu Thr Glu 
340 345 350 

gtc tga 1062 
Val 



<210> 5 
<211> 353 
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<212> PRT 

<213> Human T-cell lymphotropic virus type 1 



<400> 5 

Met Ala His Phe Pro Gly Phe Gly Gin Ser Leu Leu Phe Gly Tyr Pro 
15 10 15 



Val Tyr Val Phe Gly Asp Gly Asp Trp Cys Pro Cys Val Gin lie Ser 
20 25 30 



Gly Gly Leu Cys Ser Ala Arg Leu His Arg His Ala Leu Leu Ala Thr 
35 40 45 



Cys Pro Glu His Gin lie Thr Trp Asp Pro lie Asp Gly Arg Val lie 
50 55 60 



Gly Ser Ala Leu Gin Phe Leu lie Pro Arg Leu Pro Ser Phe Pro Thr 
65 70 75 80 



Gin Arg Thr Ser Lys Thr Leu Lys Val Leu Thr Pro Pro lie Thr His 
85 90 95 



Thr Thr Pro Asn lie Pro Pro Ser Phe Leu Gin Ala Met Arg Lys Tyr 
100 105 110 



Ser Pro Phe Arg Asn Gly Tyr Met Glu Pro Thr Leu Gly Gin His Leu 
115 120 125 



Pro Thr Leu Ser Phe Pro Asp Pro Gly Leu Arg Pro Gin Asn Leu Tyr 
130 135 140 



Thr Leu Trp Gly Gly Ser Val Val Cys Met Tyr Leu Tyr Gin Leu Ser 
145 150 155 160 



Pro Pro lie Thr Trp Pro Leu Leu Pro His Val lie Phe Cys His Pro 
165 170 175 



Gly Gin Leu Gly Ala Phe Leu Thr Asn Val Pro Tyr Lys Arg He Glu 
180 185 190 



Glu Leu Leu Tyr Lys He Ser Leu Thr Thr Gly Ala Leu lie He Leu 
195 200 205 
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Pro Glu Asp Cys Leu Pro Thr Thr Leu Phe Gin Pro Val Arg Ala Pro 
210 215 220 



Val Thr Leu Thr Ala Trp Gin Asn Gly Leu Leu Pro Phe His Ser Thr 
225 230 235 240 



Leu Thr Thr Pro Gly Leu lie Trp Thr Phe Thr Asp Gly Thr Pro Met 
245 250 255 



lie Ser Gly Pro Cys Pro Lys Asp Gly Gin Pro Ser Leu Val Leu Gin 
260 265 270 



Ser Ser Ser Phe lie Phe His Lys Phe Gin Thr Lys Ala Tyr His Pro 
275 280 285 



Ser Phe Leu Leu Ser His Gly Leu lie Gin Tyr Ser Ser Phe His Asn 
290 295 300 



Leu His Leu Leu Phe Glu Glu Tyr Thr Asn lie Pro lie Ser Leu Leu 
305 310 315 320 



Phe Asn Lys Lys Glu Ala Asp Asp Asn Asp His Glu Pro Gin lie Ser 
325 330 335 



Pro Gly Gly Leu Glu Pro Pro Ser Glu Lys His Phe Arg Glu Thr Glu 
340 345 350 



Val 



<210> 6 

<211> 353 

<212> DNA 

<213> Human T-cell lymphotropic virus type 1 

<400> 6 



tgacaatgac 


catgagcccc 


aaatatcccc 


cgggggctta 


gagcctctca 


gtgaaaaaca 


60 


tttccgtgaa 


acagaagtct 


gagaaggtca 


gggcccagaa 


taaggctctg 


acgtctcccc 


120 


ccggaggaca 


gctcagcacc 


agctcaggct 


aggccctgac 


gtgtccccct 


aaagacaaat 


180 


cataagctca 


gacctccggg 


aagccaccgg 


gaaccaccca 


tttcctcccc 


atgtttgtca 


240 


agccgtcctc 


aggcgttgac 


gacaacccct 


cacctcaaaa 


aacttttcat 


ggcacgcata 


300 


cggctcaata 


aaataacagg 


agtctataaa 


agcgtgggga 


cagttcagga 


ggg 


353 



<210> 7 
<211> 456 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 7 

ctggaagggc taatttggtc ccaaagaaga caagagatcc ttgatctgtg gatctaccac 60 
acacaaggct acttccctga ttggcagaat tacacaccag ggccagggat cagatatcca 120 
ctgacctttg gatggtgctt caagctagta ccagttgagc cagagaaggt agaagaggcc 180 
aatgaaggag agaacaacag cttgttacac cctatgagcc tgcatgggat ggaggacgcg 240 
gagaaagaag tgttagtgtg gaggtttgac agcaaactag catttcatca catggcccga 300 
gagctgcatc cggagtacta caaagactgc tgacatcgag ctttctacaa gggactttcc 360 
gctggggact ttccagggag gcgtggcctg ggcgggactg gggagtggcg tccctcagat 420 
gctgcatata agcagctgct ttttgcctgt actggg 456 



<210> 8 

<211> 306 

<212> DNA 

<213> Human immunodeficiency virus type 1 



<220> 

<221> CDS 

<222> (1) . . (303) 

<400> 8 

atg gag cca gta gat cct aat eta gag ccc tgg aag cat cca gga agt 48 

Met Glu Pro Val Asp Pro Asn Leu Glu Pro Trp Lys His Pro Gly Ser 
15 10 15 

cag cct agg act get tgt aac aat tgc tat tgt aaa aag tgt tgc ttt 96 
Gin Pro Arg Thr Ala Cys Asn Asn Cys Tyr Cys Lys Lys Cys Cys Phe 
20 25 30 

cat tgc tac gcg tgt ttc aca aga aaa ggc tta ggc ate tec tat ggc 144 
His Cys Tyr Ala Cys Phe Thr Arg Lys Gly Leu Gly lie Ser Tyr Gly 
35 40 45 

agg aag aag egg aga cag cga cga aga get cct cag gac agt cag act 192 
Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 
50 55 60 

cat caa get tct eta tea aag caa ccc gee tec cag tec cga ggg gac 240 
His Gin Ala Ser Leu Ser Lys Gin Pro Ala Ser Gin Ser Arg Gly Asp 
65 70 75 80 

ccg aca ggc ccg acg gaa teg aag aag aag gtg gag aga gag aca gag 288 
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Pro Thr Gly Pro Thr Glu Ser Lys Lys Lys Val Glu Arg Glu Thr Glu 
85 90 95 

aca gat ccg ttc gat tag 306 
Thr Asp Pro Phe Asp 
100 



<210> 9 
<211> 101 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 9 

Met Glu Pro Val Asp Pro Asn Leu Glu Pro Trp Lys His Pro Gly Ser 
15 10 15 



Gin Pro Arg Thr Ala Cys Asn Asn Cys Tyr Cys Lys Lys Cys Cys Phe 
20 25 30 



His Cys Tyr Ala Cys Phe Thr Arg Lys Gly Leu Gly lie Ser Tyr Gly 
35 40 45 



Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 
50 55 60 



His Gin Ala Ser Leu Ser Lys Gin Pro Ala Ser Gin Ser Arg Gly Asp 
65 70 75 80 



Pro Thr Gly Pro Thr Glu Ser Lys Lys Lys Val Glu Arg Glu Thr Glu 
85 90 95 



Thr Asp Pro Phe Asp 
100 



<210> 10 

<211> 7685 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered sequence from virus and plasmid 



<220> 

<221> LTR 

<222> (149) . . (737) 

<223> 5' MoMuSVLTR 
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<220> 

<221> CDS 

<222> (1753) . . (2148) 

<223> Blasticidin resistance 

<220> 

<221> promoter 

<222> (2257) . . (3074) 

<223> CMV IE promoter 

<220> 

<221> misc_recomb 

<222> (3078) . . (3102) 

<223> attBl 

<220> 

<221> CDS 

<222> (3115) . . (4041) 

<223> BLV Tax 

<220> 

<221> misc_recomb 

<222> (4046) . . (4070) 

<223> attB2 

<220> 

<221> misc_signal 

<222> (4082) . . (4674) 

<223> WPRE; woodchuck hepatitis virus post-transcriptional regulatory 
element 

<220> 

<221> LTR 

<222> (4720) . . (5313) 

<223> 3' MoMuLVLTR 

<400> 10 

gaattaattc ataccagatc accgaaaact gtcctccaaa tgtgtccccc tcacactccc 60 

aaattcgcgg gcttctgcct cttagaccac tctaccctat tccccacact caccggagcc 120 

aaagccgcgg cccttccgtt tctttgcttt tgaaagaccc cacccgtagg tggcaagcta 180 

gcttaagtaa cgccactttg caaggcatgg aaaaatacat aactgagaat agaaaagttc 240 

agatcaaggt caggaacaaa gaaacagctg aataccaaac aggatatctg tggtaagcgg 300 

ttcctgcccc ggctcagggc caagaacaga tgagacagct gagtgatggg ccaaacagga 360 

tatctgtggt aagcagttcc tgccccggct cggggccaag aacagatggt ccccagatgc 420 

ggtccagccc tcagcagttt ctagtgaatc atcagatgtt tccagggtgc cccaaggacc 4 80 

tgaaaatgac cctgtacctt atttgaacta accaatcagt tcgcttctcg cttctgttcg 540 

cgcgcttccg ctctccgagc tcaataaaag agcccacaac ccctcactcg gcgcgccagt 600 
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cttccgatag 


actgcgtcgc 


ccgggtaccc 


gtattcccaa 


taaagcctct 


tgctgtttgc 


660 


atccgaatcg 


tggtctcgct 


gttccttggg 


agggtctcct 


ctgagtgatt 


gactacccac 


720 


gacgggggtc 


tttcatttgg 


gggctcgtcc 


gggatttgga 


gacccctgcc 


cagggaccac 


780 


cgacccacca 


ccgggaggta 


agctggccag 


caacttatct 


gtgtctgtcc 


gattgtctag 


840 


tgtctatgtt 


tgatgttatg 


cgcctgcgtc 


tgtactagtt 


agctaactag 


ctctgtatct 


900 


ggcggacccg 


tggtggaact 


gacgagttct 


gaacacccgg 


ccgcaaccct 


gggagacgtc 


960 


ccagggactt 


tgggggccgt 


ttttgtggcc 


cgacctgagg 


aagggagtcg 


atgtggaatc 


1020 


cgaccccgtc 


aggatatgtg 


gttctggtag 


gagacgagaa 


cctaaaacag 


ttcccgcctc 


1080 


cgtctgaatt 


tttgctttcg 


gtttggaacc 


gaagccgcgc 


gtcttgtctg 


ctgcagcgct 


1140 


gcagcatcgt 


tctgtgttgt 


ctctgtctga 


ctgtgtttct 


gtatttgtct 


gaaaattagg 


1200 


gccagactgt 


taccactccc 


ttaagtttga 


ccttaggtca 


ctggaaagat 


gtcgagcgga 


1260 


tcgctcacaa 


ccagtcggta 


gatgtcaaga 


agagacgttg 


ggttaccttc 


tgctctgcag 


1320 


aatggccaac 


ctttaacgtc 


ggatggccgc 


gagacggcac 


ctttaaccga 


gacctcatca 


1380 


cccaggttaa 


gatcaaggtc 


ttttcacctg 


gcccgcatgg 


acacccagac 


caggtcccct 


1440 


acatcgtgac 


ctgggaagcc 


ttggcttttg 


acccccctcc 


ctgggtcaag 


ccctttgtac 


1500 


accctaagcc 


tccgcctcct 


cttcctccat 


ccgccccgtc 


tctccccctt 


gaacctcctc 


IDOU 


gttcgacccc 


gcctcgatcc 


tccctttatc 


cagccctcac 


tccttctcta 


ggcgccggaa 


1620 


ttccgatctg 


atcaagagac 


aggatgaggg 


agcttgtata 


tccattttcg 


gatctgatca 


1680 


gcacgtgttg 


acaattaatc 


atcggcatag 


tatatcggca 


tagtataata 


cgacaaggtg 


1740 



aggaactaaa cc atg gcc aag cct ttg tct caa gaa gaa tec acc etc att 1791 
Met Ala Lys Pro Leu Ser Gin Glu Glu Ser Thr Leu lie 
15 10 

gaa aga gca acg get aca ate aac age ate ccc ate tct gaa gac tac 1839 
Glu Arg Ala Thr Ala Thr lie Asn Ser lie Pro lie Ser Glu Asp Tyr 
15 20 25 

age gtc gcc age gca get etc tct age gac ggc cgc ate ttc act ggt 1887 
Ser Val Ala Ser Ala Ala Leu Ser Ser Asp Gly Arg lie Phe Thr Gly 
30 35 40 45 

gtc aat gta tat cat ttt act ggg gga cct tgt gca gaa etc gtg gtg 1935 
Val Asn Val Tyr His Phe Thr Gly Gly Pro Cys Ala Glu Leu Val Val 
50 55 60 

ctg ggc act get get get gcg gca get ggc aac ctg act tgt ate gtc 1983 
Leu Gly Thr Ala Ala Ala Ala Ala Ala Gly Asn Leu Thr Cys lie Val 
65 70 75 
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gcg ate gga aat gag aac agg ggc ate ttg age ccc tgc gga egg tgt 
Ala lie Gly Asn Glu Asn Arg Gly lie Leu Ser Pro Cys Gly Arg Cys 
80 85 90 



2031 



cga cag gtg ctt etc gat ctg cat cct ggg ate aaa gcg ata gtg aag 2079 
Arg Gin Val Leu Leu Asp Leu His Pro Gly lie Lys Ala lie Val Lys 
95 100 105 

gac agt gat gga cag ccg acg gca gtt ggg att cgt gaa ttg ctg ccc 2127 
Asp Ser Asp Gly Gin Pro Thr Ala Val Gly lie Arg Glu Leu Leu Pro 
110 115 120 125 

tct ggt tat gtg tgg gag ggc taagcacttc gtggccgagg agcaggactg 2178 
Ser Gly Tyr Val Trp Glu Gly 
130 



acaegtgeta 


cgagatttcg 


attccaccgc 


cgccttctat 


gaaaggttgg 


getteggaat 


2238 


cgttttccgg 


gacgccgatc 


eggecattag 


ccatattatt 


cattggttat 


atagcataaa 


2298 


tcaatattgg 


etattggeca 


ttgcatacgt 


tgtatccata 


tcataatatg 


tacatttata 


2358 


ttggctcatg 


tccaacatta 


ccgccatgtt 


gacattgatt 


attgactagt 


tattaatagt 


2418 


aatcaattac 


ggggtcatta 


gttcatagee 


catatatgga 


gttccgcgtt 


acataactta 


2478 


eggtaaatgg 


cccgcctggc 


tgaccgccca 


acgacccccg 


cccattgacg 


tcaataatga 


2538 


cgtatgttcc 


catagtaacg 


ccaataggga 


ctttccattg 


aegtcaatgg 


gtggagtatt 


2598 


tacggtaaac 


tgcccacttg 


gcagtacatc 


aagtgtatca 


tatgecaagt 


acgcccccta 


2658 


ttgacgtcaa 


tgacggtaaa 


tggcccgcct 


ggcattatgc 


ccagtacatg 


accttatggg 


2718 


actttcctac 


ttggcagtac 


atetaegtat 


tagtcatege 


tattaccatg 


gtgatgcggt 


2778 


tttggcagta 


catcaatggg 


cgtggatagc 


ggtttgactc 


aeggggattt 


ccaagtctcc 


2838 


accccattga 


cgtcaatggg 


agtttgtttt 


ggcaccaaaa 


teaaegggae 


tttccaaaat 


2898 


gtcgtaacaa 


ctccgcccca 


ttgacgcaaa 


tgggeggtag 


geatgtaegg 


tgggaggtct 


2958 


atataagcag 


agetegttta 


gtgaaccgtc 


agatcgectg 


gagacgecat 


ccacgctgtt 


3018 


ttgacctcca 


tagaagacac 


egggaccgat 


ccagcctccg 


cggccccaag 


cttgttatca 


3078 


caagtttgta 


caaaaaagca 


ggctcccgcc 


gccacc atg 
Met 


gca agt gtt 
Ala Ser Val 


gtt ggt 
Val Gly 


3132 



135 

tgg ggg ccc cac tct eta cat gee tgc ccg gee ctg gtt ttg tec aat 3180 
Trp Gly Pro His Ser Leu His Ala Cys Pro Ala Leu Val Leu Ser Asn 
140 145 150 

gat gtc acc ate gat gee tgg tgc ccc etc tgc ggg ccc cat gag cga 3228 
Asp Val Thr lie Asp Ala Trp Cys Pro Leu Cys Gly Pro His Glu Arg 
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155 160 165 170 

etc caa ttc gaa agg ate gac acc acg etc acc tgc gag ace cac cgt 3276 
Leu Gin Phe Glu Arg lie Asp Thr Thr Leu Thr Cys Glu Thr His Arg 
175 180 185 

ate aac tgg acc gee gat gga cga cct tgc ggc etc aat gga acg ttg 3324 
lie Asn Trp Thr Ala Asp Gly Arg Pro Cys Gly Leu Asn Gly Thr Leu 
190 195 200 

ttc cct cga ctg cat gtc tec gag acc cgc ccc caa ggg ccc cga cga 3372 
Phe Pro Arg Leu His Val Ser Glu Thr Arg Pro Gin Gly Pro Arg Arg 
205 210 215 

etc tgg ate aac tgc ccc ctt ccg gee gtt cgc get cag ccc ggc ccg 3420 
Leu Trp lie Asn Cys Pro Leu Pro Ala Val Arg Ala Gin Pro Gly Pro 
220 225 230 

gtt tea ctt tec ccc ttc gag egg tec ccc ttc cag ccc tac caa tgc 3468 
Val Ser Leu Ser Pro Phe Glu Arg Ser Pro Phe Gin Pro Tyr Gin Cys 
235 240 245 250 

caa ttg ccc teg gee tct age gac ggt tgc ccc att ate ggg cac ggc 3516 
Gin Leu Pro Ser Ala Ser Ser Asp Gly Cys Pro lie lie Gly His Gly 
255 260 265 

ctt ctt ccc tgg aac aac tta gta acg cat cct gtc etc aga aaa gtc 3564 
Leu Leu Pro Trp Asn Asn Leu Val Thr His Pro Val Leu Arg Lys Val 
270 275 280 

ctt ata tta aat caa atg gee aat ttt tec tta etc ccc tec ttc gat 3612 
Leu lie Leu Asn Gin Met Ala Asn Phe Ser Leu Leu Pro Ser Phe Asp 
285 290 295 

acc etc ctt gtg gac ccc etc egg ctg tec gtc ttt gee cca gac acc 3660 
Thr Leu Leu Val Asp Pro Leu Arg Leu Ser Val Phe Ala Pro Asp Thr 
300 305 310 

agg gga gee ata cgt tat etc tec acc ctt ttg acg eta tgc ccg get 3708 
Arg Gly Ala lie Arg Tyr Leu Ser Thr Leu Leu Thr Leu Cys Pro Ala 
315 320 325 330 

act tgt att eta ccc eta ggc gag ccc ttc tct cct aat gtc ccc ata 3756 
Thr Cys lie Leu Pro Leu Gly Glu Pro Phe Ser Pro Asn Val Pro lie 
335 340 345 

tgc cgc ttt ccc egg gac tec aat gaa ccc ccc ctt tea gaa ttc gag 3804 
Cys Arg Phe Pro Arg Asp Ser Asn Glu Pro Pro Leu Ser Glu Phe Glu 
350 355 360 

ctg ccc ctt ate caa acg ccc ggc ctg tct tgg tct gtc ccc gcg ate 3852 
Leu Pro Leu lie Gin Thr Pro Gly Leu Ser Trp Ser Val Pro Ala lie 
365 370 375 

gac eta ttc eta acc ggt ccc cct tec cca tgc gac egg tta cac gta 3900 
Asp Leu Phe Leu Thr Gly Pro Pro Ser Pro Cys Asp Arg Leu His Val 
380 385 390 
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tgg tec agt cct cag gec tta cag cgc ttc ctt cat gac cct acg eta 3948 
Trp Ser Ser Pro Gin Ala Leu Gin Arg Phe Leu His Asp Pro Thr Leu 
395 400 405 410 

acc tgg tec gaa tta gtt get age aga aaa ata aga ctt gat tec ccc 3996 
Thr Trp Ser Glu Leu Val Ala Ser Arg Lys lie Arg Leu Asp Ser Pro 
415 420 425 

tta aaa tta caa ctg eta gaa aat gaa tgg etc tec cgc ctt ttt 4041 
Leu Lys Leu Gin Leu Leu Glu Asn Glu Trp Leu Ser Arg Leu Phe 
430 435 440 



tgagacccag 


ctttcttgta 


caaagtggtg 


ataacatcga 


taatcaacct 


ctggattaca 


4101 


aaatttgtga 


aagattgact 


ggtattctta 


actatgttgc 


tccttttacg 


ctatgtggat 


4161 


aegctgettt 


aatgcctttg 


tatcatgeta 


ttgcttcccg 


tatggctttc 


attttctcct 


4221 


ccttgtataa 


atcctggttg 


ctgtctcttt 


atgaggagtt 


gtggcccgtt 


gtcaggcaac 


4281 


gtggcgtggt 


gtgcactgtg 


tttgctgacg 


caacccccac 


tggttggggc 


attgccacca 


4341 


cctgtcagct 


cctttccggg 


actttcgett 


tccccctccc 


tattgecacg 


geggaactea 


4401 


tcgccgcctg 


ccttgcccgc 


tgctggacag 


gggctegget 


gttgggcact 


gacaattccg 


4461 


tggtgttgtc 


ggggaaatca 


tcgtcctttc 


ettggctget 


cgcctgtgtt 


gccacctgga 


4521 


ttctgegegg 


gacgtccttc 


tgctacgtcc 


cttcggccct 


caatccagcg 


gaccttcctt 


4581 


cccgcggcct 


gctgccggct 


ctgcggcctc 


ttccgcgtct 


tcgccttcgc 


cctcagacga 


4641 


gteggatetc 


cctttgggcc 


gcctccccgc 


ctgatcgata 


aaataaaaga 


ttttatttag 


4701 


tctccagaaa 


aaggggggaa 


tgaaagaccc 


cacctgtagg 


tttggcaagc 


tagcttaagt 


4761 


aacgecattt 


tgcaaggcat 


ggaaaaatac 


ataactgaga 


atagagaagt 


tcagatcaag 


4821 


gtcaggaaca 


gatggaacag 


ctgaatatgg 


gecaaacagg 


atatctgtgg 


taagcagttc 


4881 


ctgccccggc 


teagggecaa 


gaacagatgg 


aacagctgaa 


tatgggccaa 


acaggatatc 


4941 


tgtggtaagc 


agttcctgcc 


ccggctcagg 


gecaagaaca 


gatggtcccc 


agatgeggtc 


5001 


cagccctcag 


cagtttctag 


agaaccatca 


gatgtttcca 


gggtgcccca 


aggacctgaa 


5061 


atgaccctgt 


gecttatttg 


aactaaccaa 


teagttcget 


tctcgcttct 


gttcgcgcgc 


5121 


ttctgctccc 


cgagctcaat 


aaaagagece 


acaacccctc 


acteggggeg 


ccagtcctcc 


5181 


gattgactga 


gtcgcccggg 


tacccgtgta 


tccaataaac 


cctcttgcag 


ttgeatcega 


5241 


cttgtggtct 


cgctgttcct 


tgggagggtc 


tcctctgagt 


gattgactac 


ccgtcagcgg 


5301 


gggtctttca 


tttttccatt 


gggggctcgt 


cegggategg 


gagacccctg 


cccagggacc 


5361 
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accgacccac 


caccgggagg 


taagctggct 


gcctcgcgcg 


tttcggtgat 


gacggtgaaa 


5421 


acctctgaca 


catgcagctc 


ccggagacgg 


tcacagcttg 


tctgtaagcg 


gatgccggga 


5481 


gcagacaagc 


ccgtcagggc 


gcgtcagcgg 


gtgttggcgg 


gtgtcggggc 


gcagccatga 


5541 


cccagtcacg 


tagcgatagc 


ggagtgtata 


ctggcttaac 


tatgcggcat 


cagagcagat 


5601 


tgtactgaga 


gtgcaccata 


tgcggtgtga 


aataccgcac 


agatgcgtaa 


ggagaaaata 


5661 


ccgcatcagg 


cgctcttccg 


cttcctcgct 


cactgactcg 


ctgcgctcgg 


tcgttcggct 


5721 


gcggcgagcg 


gtatcagctc 


actcaaaggc 


ggtaatacgg 


ttatccacag 


aatcagggga 


5781 


taacgcagga 


aagaacatgt 


gagcaaaagg 


ccagcaaaag 


gccaggaacc 


gtaaaaaggc 


5841 


cgcgttgctg 


gcgtttttcc 


ataggctccg 


cccccctgac 


gagcatcaca 


aaaatcgacg 


5901 


ctcaagtcag 


aggtggcgaa 


acccgacagg 


actataaaga 


taccaggcgt 


ttccccctgg 


5961 


aagctccctc 


gtgcgctctc 


ctgttccgac 


cctgccgctt 


accggatacc 


tgtccgcctt 


6021 


tctcccttcg 


ggaagcgtgg 


cgctttctca 


tagctcacgc 


tgtaggtatc 


tcagttcggt 


6081 


gtaggtcgtt 


cgctccaagc 


tgggctgtgt 


gcacgaaccc 


cccgttcagc 


ccgaccgctg 


6141 


cgccttatcc 


ggtaactatc 


gtcttgagtc 


caacccggta 


agacacgact 


tatcgccact 


6201 


ggcagcagcc 


actggtaaca 


ggattagcag 


agcgaggtat 


gtaggcggtg 


ctacagagtt 


6261 


cttgaagtgg 


tggcctaact 


acggctacac 


tagaaggaca 


gtatttggta 


tctgcgctct 


6321 


gctgaagcca 


gttaccttcg 


gaaaaagagt 


tggtagctct 


tgatccggca 


aacaaaccac 


6381 


cgctggtagc 


ggtggttttt 


ttgtttgcaa 


gcagcagatt 


acgcgcagaa 


aaaaaggatc 


6441 


tcaagaagat 


cctttgatct 


tttctacggg 


gtctgacgct 


cagtggaacg 


aaaactcacg 


6501 


ttaagggatt 


ttggtcatga 


gattatcaaa 


aaggatcttc 


acctagatcc 


ttttaaatta 


6561 


aaaatgaagt 


tttaaatcaa 


tctaaagtat 


atatgagtaa 


acttggtctg 


acagttacca 


6621 


atgcttaatc 


agtgaggcac 


ctatctcagc 


gatctgtcta 


tttcgttcat 


ccatagttgc 


6681 


ctgactcccc 


gtcgtgtaga 


taactacgat 


acgggagggc 


ttaccatctg 


gccccagtgc 


6741 


tgcaatgata 


ccgcgagacc 


cacgctcacc 


ggctccagat 


ttatcagcaa 


taaaccagcc 


6801 


agccggaagg 


gccgagcgca 


gaagtggtcc 


tgcaacttta 


tccgcctcca 


tccagtctat 


6861 


taattgttgc 


cgggaagcta 


gagtaagtag 


ttcgccagtt 


aatagtttgc 


gcaacgttgt 


6921 


tgccattgct 


gcaggcatcg 


tggtgtcacg 


ctcgtcgttt 


ggtatggctt 


cattcagctc 


6981 


cggttcccaa 


cgatcaaggc 


gagttacatg 


atcccccatg 


ttgtgcaaaa 


aagcggttag 


7041 


ctccttcggt 


cctccgatcg 


ttgtcagaag 


taagttggcc 


gcagtgttat 


cactcatggt 


7101 



tatggcagca 


ctgcataatt 


ctcttactgt 


catgccatcc 


gtaagatgct 


tttctgtgac 


7161 


tggtgagtac 


tcaaccaagt 


cattctgaga 


atagtgtatg 


cggcgaccga 


gttgctcttg 


7221 


cccggcgtca 


acacgggata 


ataccgcgcc 


acatagcaga 


actttaaaag 


tgctcatcat 


7281 


tggaaaacgt 


tcttcggggc 


gaaaactctc 


aaggatctta 


ccgctgttga 


gatccagttc 


7341 


gatgtaaccc 


actcgtgcac 


ccaactgatc 


ttcagcatct 


tttactttca 


ccagcgtttc 


7401 


tgggtgagca 


aaaacaggaa 


ggcaaaatgc 


cgcaaaaaag 


ggaataaggg 


cgacacggaa 


7461 


atgttgaata 


ctcatactct 


tcctttttca 


atattattga 


agcatttatc 


agggttattg 


7521 


tctcatgagc 


ggatacatat 


ttgaatgtat 


ttagaaaaat 


aaacaaatag 


gggttccgcg 


7581 


cacatttccc 


cgaaaagtgc 


cacctgacgt 


ctaagaaacc 


attattatca 


tgacattaac 


7641 


ctataaaaat 


aggcgtatca 


cgaggccctt 


tcgtcttcaa 


gaat 




7685 



<210> 11 

<211> 132 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 11 

Met Ala Lys Pro Leu Ser Gin Glu Glu Ser Thr Leu lie Glu Arg Ala 
15 10 15 

Thr Ala Thr lie Asn Ser lie Pro lie Ser Glu Asp Tyr Ser Val Ala 
20 25 30 

Ser Ala Ala Leu Ser Ser Asp Gly Arg lie Phe Thr Gly Val Asn Val 
35 40 45 

Tyr His Phe Thr Gly Gly Pro Cys Ala Glu Leu Val Val Leu Gly Thr 
50 55 60 

Ala Ala Ala Ala Ala Ala Gly Asn Leu Thr Cys lie Val Ala He Gly 
65 70 75 80 

Asn Glu Asn Arg Gly He Leu Ser Pro Cys Gly Arg Cys Arg Gin Val 
85 90 95 

Leu Leu Asp Leu His Pro Gly He Lys Ala He Val Lys Asp Ser Asp 

17 



100 



105 



110 



Gly Gin Pro Thr Ala Val Gly lie Arg Glu Leu Leu Pro Ser Gly Tyr 
115 120 125 



Val Trp Glu Gly 
130 



<210> 12 

<211> 309 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic Construct 
<400> 12 



Met Ala Ser Val Val Gly Trp Gly Pro His Ser Leu His Ala Cys Pro 
15 10 15 



Ala Leu Val Leu Ser Asn Asp Val Thr lie Asp Ala Trp Cys Pro Leu 
20 25 30 



Cys Gly Pro His Glu Arg Leu Gin Phe Glu Arg lie Asp Thr Thr Leu 
35 40 45 



Thr Cys Glu Thr His Arg lie Asn Trp Thr Ala Asp Gly Arg Pro Cys 
50 55 60 



Gly Leu Asn Gly Thr Leu Phe Pro Arg Leu His Val Ser Glu Thr Arg 
65 70 75 80 



Pro Gin Gly Pro Arg Arg Leu Trp lie Asn Cys Pro Leu Pro Ala Val 
85 90 95 



Arg Ala Gin Pro Gly Pro Val Ser Leu Ser Pro Phe Glu Arg Ser Pro 
100 105 110 



Phe Gin Pro Tyr Gin Cys Gin Leu Pro Ser Ala Ser Ser Asp Gly Cys 
115 120 125 



Pro lie lie Gly His Gly Leu Leu Pro Trp Asn Asn Leu Val Thr His 
130 135 140 
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Pro Val Leu Arg Lys Val Leu lie Leu Asn Gin Met Ala Asn Phe Ser 
145 150 155 160 



Leu Leu Pro Ser Phe Asp Thr Leu Leu Val Asp Pro Leu Arg Leu Ser 
165 170 175 



Val Phe Ala Pro Asp Thr Arg Gly Ala lie Arg Tyr Leu Ser Thr Leu 
180 185 190 



Leu Thr Leu Cys Pro Ala Thr Cys lie Leu Pro Leu Gly Glu Pro Phe 
195 200 205 



Ser Pro Asn Val Pro lie Cys Arg Phe Pro Arg Asp Ser Asn Glu Pro 
210 215 220 



Pro Leu Ser Glu Phe Glu Leu Pro Leu lie Gin Thr Pro Gly Leu Ser 
225 230 235 240 



Trp Ser Val Pro Ala lie Asp Leu Phe Leu Thr Gly Pro Pro Ser Pro 
245 250 255 



Cys Asp Arg Leu His Val Trp Ser Ser Pro Gin Ala Leu Gin Arg Phe 
260 265 270 



Leu His Asp Pro Thr Leu Thr Trp Ser Glu Leu Val Ala Ser Arg Lys 
275 280 285 



lie Arg Leu Asp Ser Pro Leu Lys Leu Gin Leu Leu Glu Asn Glu Trp 
290 295 300 



Leu Ser Arg Leu Phe 
305 



<210> 13 

<211> 7685 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered sequence form virus and plasmid 



<220> 

<221> CDS 

<222> (210) . . (1070) 

<223> Anipicillin resistance 
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<400> 13 

attcttgaag acgaaagggc ctcgtgatac gcctattttt ataggttaat gtcatgataa 60 



taatggtttc ttagacgtca ggtggcactt ttcggggaaa tgtgcgcgga acccctattt 120 

gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa 180 

tgcttcaata atattgaaaa aggaagagt atg agt att caa cat ttc cgt gtc 233 

Met Ser He Gin His Phe Arg Val 
1 5 

gcc ctt att ccc ttt ttt gcg gca ttt tgc ctt cct gtt ttt get cac 281 

Ala Leu He Pro Phe Phe Ala Ala Phe Cys Leu Pro Val Phe Ala His 
10 15 20 

cca gaa acg ctg gtg aaa gta aaa gat get gaa gat cag ttg ggt gca 329 
Pro Glu Thr Leu Val Lys Val Lys Asp Ala Glu Asp Gin Leu Gly Ala 
25 30 35 40 

cga gtg ggt tac ate gaa ctg gat etc aac age ggt aag ate ctt gag 377 
Arg Val Gly Tyr He Glu Leu Asp Leu Asn Ser Gly Lys He Leu Glu 
45 50 55 

agt ttt cgc ccc gaa gaa cgt ttt cca atg atg age act ttt aaa gtt 42 5 

Ser Phe Arg Pro Glu Glu Arg Phe Pro Met Met Ser Thr Phe Lys Val 
60 65 70 

ctg eta tgt ggc gcg gta tta tec cgt gtt gac gcc ggg caa gag caa 473 
Leu Leu Cys Gly Ala Val Leu Ser Arg Val Asp Ala Gly Gin Glu Gin 
75 80 85 

etc ggt cgc cgc ata cac tat tct cag aat gac ttg gtt gag tac tea 521 
Leu Gly Arg Arg He His Tyr Ser Gin Asn Asp Leu Val Glu Tyr Ser 
90 95 100 

cca gtc aca gaa aag cat ctt acg gat ggc atg aca gta aga gaa tta 569 
Pro Val Thr Glu Lys His Leu Thr Asp Gly Met Thr Val Arg Glu Leu 
105 110 115 120 

tgc agt get gcc ata ace atg agt gat aac act gcg gcc aac tta ctt 617 
Cys Ser Ala Ala He Thr Met Ser Asp Asn Thr Ala Ala Asn Leu Leu 
125 130 135 

ctg aca acg ate gga gga ccg aag gag eta ace get ttt ttg cac aac 665 
Leu Thr Thr lie Gly Gly Pro Lys Glu Leu Thr Ala Phe Leu His Asn 
140 145 150 

atg ggg gat cat gta act cgc ctt gat cgt tgg gaa ccg gag ctg aat 713 
Met Gly Asp His Val Thr Arg Leu Asp Arg Trp Glu Pro Glu Leu Asn 
155 160 165 

gaa gcc ata cca aac gac gag cgt gac ace acg atg cct gca gca atg 761 
Glu Ala He Pro Asn Asp Glu Arg Asp Thr Thr Met Pro Ala Ala Met 
170 175 180 

gca aca acg ttg cgc aaa eta tta act ggc gaa eta ctt act eta get 809 
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Ala Thr Thr Leu Arg Lys Leu Leu Thr Gly Glu Leu Leu Thr Leu Ala 
185 190 195 200 



tec egg caa caa tta ata gac tgg atg gag gcg gat aaa gtt gca gga 857 
Ser Arg Gin Gin Leu lie Asp Trp Met Glu Ala Asp Lys Val Ala Gly 
205 210 215 

cca ctt ctg cgc teg gee ctt ccg get ggc tgg ttt att get gat aaa 905 
Pro Leu Leu Arg Ser Ala Leu Pro Ala Gly Trp Phe lie Ala Asp Lys 
220 225 230 

tct gga gee ggt gag cgt ggg tct cgc ggt ate att gca gca ctg ggg 953 
Ser Gly Ala Gly Glu Arg Gly Ser Arg Gly lie lie Ala Ala Leu Gly 
235 240 245 

cca gat ggt aag ccc tec cgt ate gta gtt ate tac acg acg ggg agt 1001 
Pro Asp Gly Lys Pro Ser Arg lie Val Val lie Tyr Thr Thr Gly Ser 
250 255 260 

cag gca act atg gat gaa cga aat aga cag ate get gag ata ggt gee 1049 
Gin Ala Thr Met Asp Glu Arg Asn Arg Gin lie Ala Glu lie Gly Ala 
265 270 275 280 

tea ctg att aag cat tgg taa ctgtcagacc aagtttactc atatatactt 1100 
Ser Leu lie Lys His Trp 
285 

tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1160 

aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagegtc agaccccgta 1220 

gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1280 

acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1340 

tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag 1400 

ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1460 

atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttacegg gttggactca 1520 

agacgatagt taceggataa ggegcagegg tegggctgaa eggggggtte gtgcacacag 1580 

cccagcttgg agegaacgae ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1640 

agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc eggtaagegg cagggtegga 1700 

acaggagagc gcacgaggga gcttccaggg ggaaacgect ggtatcttta tagtcctgtc 1760 

gggtttcgcc acctctgact tgagegtega tttttgtgat getegtcagg ggggeggage 1820 

ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1880 

gctcacatgt tctttcctgc gttatcccct gattctgtgg ataacegtat taccgccttt 1940 

gagtgagctg ataccgctcg ccgcagccga acgaccgagc geagegagtc agtgagcgag 2 000 
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gaagcggaag 


agcgcctgat 


gcggtatttt 


ctccttacgc 


atctgtgcgg 


tatttcacac 


2060 


cgcatatggt 


gcactctcag 


tacaatctgc 


tctgatgccg 


catagttaag 


ccagtataca 


2120 


ctccgctatc 


gctacgtgac 


tgggtcatgg 


ctgcgccccg 


acacccgcca 


acacccgctg 


2180 


acgcgccctg 


acgggcttgt 


ctgctcccgg 


catccgctta 


cagacaagct 


gtgaccgtct 


2240 


ccgggagctg 


catgtgtcag 


aggttttcac 


cgtcatcacc 


gaaacgcgcg 


aggcagccag 


2300 


cttacctccc 


ggtggtgggt 


cggtggtccc 


tgggcagggg 


tctcccgatc 


ccggacgagc 


2360 


ccccaatgga 


aaaatgaaag 


acccccgctg 


acgggtagtc 


aatcactcag 


aggagaccct 


2420 


cccaaggaac 


agcgagacca 


caagtcggat 


gcaactgcaa 


gagggtttat 


tggatacacg 


2480 


ggtacccggg 


cgactcagtc 


aatcggagga 


ctggcgcccc 


gagtgagggg 


ttgtgggctc 


2540 


ttttattgag 


ctcggggagc 


agaagcgcgc 


gaacagaagc 


gagaagcgaa 


ctgattggtt 


2600 


agttcaaata 


aggcacaggg 


tcatttcagg 


tccttggggc 


accctggaaa 


catctgatgg 


2660 


ttctctagaa 


actgctgagg 


gctggaccgc 


atctggggac 


catctgttct 


tggccctgag 


2720 


ccggggcagg 


aactgcttac 


cacagatatc 


ctgtttggcc 


catattcagc 


tgttccatct 


2780 


gttcttggcc 


ctgagccggg 


gcaggaactg 


cttaccacag 


atatcctgtt 


tggcccatat 


2840 


tcagctgttc 


catctgttcc 


tgaccttgat 


ctgaacttct 


ctattctcag 


ttatgtattt 


2900 


ttccatgcct 


tgcaaaatgg 


cgttacttaa 


gctagcttgc 


caaacctaca 


ggtggggtct 


2960 


ttcattcccc 


cctttttctg 


gagactaaat 


aaaatctttt 


attttatcga 


tcaggcgggg 


3020 


aggcggccca 


aagggagatc 


cgactcgtct 


gagggcgaag 


gcgaagacgc 


ggaagaggcc 


3080 


gcagagccgg 


cagcaggccg 


cgggaaggaa 


ggtccgctgg 


attgagggcc 


gaagggacgt 


3140 


agcagaagga 


cgtcccgcgc 


agaatccagg 


tggcaacaca 


ggcgagcagc 


caaggaaagg 


3200 


acgatgattt 


ccccgacaac 


accacggaat 


tgtcagtgcc 


caacagccga 


gcccctgtcc 


3260 


agcagcgggc 


aaggcaggcg 


gcgatgagtt 


ccgccgtggc 


aatagggagg 


gggaaagcga 


3320 


aagtcccgga 


aaggagctga 


caggtggtgg 


caatgcccca 


accagtgggg 


gttgcgtcag 


3380 


caaacacagt 


gcacaccacg 


ccacgttgcc 


tgacaacggg 


ccacaactcc 


tcataaagag 


3440 


acagcaacca 


ggatttatac 


aaggaggaga 


aaatgaaagc 


catacgggaa 


gcaatagcat 


3500 


gatacaaagg 


cattaaagca 


gcgtatccac 


atagcgtaaa 


aggagcaaca 


tagttaagaa 


3560 


taccagtcaa 


tctttcacaa 


attttgtaat 


ccagaggttg 


attatcgatg 


ttatcaccac 


3620 


tttgtacaag 


aaagctgggt 


ctcaaaaaag 


gcgggagagc 


cattcatttt 


ctagcagttg 


3680 


taattttaag 


ggggaatcaa 


gtcttatttt 


tctgctagca 


actaattcgg 


accaggttag 


3740 



cgtagggtca 


tgaaggaagc 


gctgtaaggc 


ctgaggactg 


gaccatacgt 


gtaaccggtc 


3800 


gcatggggaa 


gggggaccgg 


ttaggaatag 


gtcgatcgcg 


gggacagacc 


aagacaggcc 


3860 


gggcgtttgg 


ataaggggca 


gctcgaattc 


tgaaaggggg 


ggttcattgg 


agtcccgggg 


3920 


aaagcggcat 


atggggacat 


taggagagaa 


gggctcgcct 


aggggtagaa 


tacaagtagc 


3980 


cgggcatagc 


gtcaaaaggg 


tggagagata 


acgtatggct 


cccctggtgt 


ctggggcaaa 


4040 


gacggacagc 


cggagggggt 


ccacaaggag 


ggtatcgaag 


gaggggagta 


aggaaaaatt 


4100 


ggccatttga 


tttaatataa 


ggacttttct 


gaggacagga 


tgcgttacta 


agttgttcca 


4160 


gggaagaagg 


ccgtgcccga 


taatggggca 


accgtcgcta 


gaggccgagg 


gcaattggca 


4220 


ttggtagggc 


tggaaggggg 


accgctcgaa 


gggggaaagt 


gaaaccgggc 


cgggctgagc 


4280 


gcgaacggcc 


ggaagggggc 


agttgatcca 


gagtcgtcgg 


ggcccttggg 


ggcgggtctc 


4340 


ggagacatgc 


agtcgaggga 


acaacgttcc 


attgaggccg 


caaggtcgtc 


catcggcggt 


4400 


ccagttgata 


cggtgggtct 


cgcaggtgag 


cgtggtgtcg 


atcctttcga 


attggagtcg 


4460 


ctcatggggc 


ccgcagaggg 


ggcaccaggc 


atcgatggtg 


acatcattgg 


acaaaaccag 


4520 


ggccgggcag 


gcatgtagag 


agtggggccc 


ccaaccaaca 


acacttgcca 


tggtggcggc 


4580 


gggagcctgc 


ttttttgtac 


aaacttgtga 


taacaagctt 


ggggccgcgg 


aggctggatc 


4640 


ggtcccggtg 


tcttctatgg 


aggtcaaaac 


agcgtggatg 


gcgtctccag 


gcgatctgac 


4700 


ggttcactaa 


acgagctctg 


cttatataga 


cctcccaccg 


tacatgccta 


ccgcccattt 


4760 


gcgtcaatgg 


ggcggagttg 


ttacgacatt 


ttggaaagtc 


ccgttgattt 


tggtgccaaa 


4820 


acaaactccc 


attgacgtca 


atggggtgga 


gacttggaaa 


tccccgtgag 


tcaaaccgct 


4880 


atccacgccc 


attgatgtac 


tgccaaaacc 


gcatcaccat 


ggtaatagcg 


atgactaata 


4940 


cgtagatgta 


ctgccaagta 


ggaaagtccc 


ataaggtcat 


gtactgggca 


taatgccagg 


5000 


cgggccattt 


accgtcattg 


acgtcaatag 


ggggcgtact 


tggcatatga 


tacacttgat 


5060 


gtactgccaa 


gtgggcagtt 


taccgtaaat 


actccaccca 


ttgacgtcaa 


tggaaagtcc 


5120 


ctattggcgt 


tactatggga 


acatacgtca 


ttattgacgt 


caatgggcgg 


gggtcgttgg 


5180 


gcggtcagcc 


aggcgggcca 


tttaccgtaa 


gttatgtaac 


gcggaactcc 


atatatgggc 


5240 


tatgaactaa 


tgaccccgta 


attgattact 


attaataact 


agtcaataat 


caatgtcaac 


5300 


atggcggtaa 


tgttggacat 


gagccaatat 


aaatgtacat 


attatgatat 


ggatacaacg 


5360 


tatgcaatgg 


ccaatagcca 


atattgattt 


atgctatata 


accaatgaat 


aatatggcta 


5420 



atggccggat 


cggcgtcccg 


gaaaacgatt 


gtggaatcga 


aatctcgtag 


cacgtgtcag 


ctcccacaca 


taaccagagg 


gcagcaattc 


actgtccttc 


actatcgctt 


tgatcccagg 


tccgcagggg 


ctcaagatgc 


ccctgttctc 


gccagctgcc 


gcagcagcag 


cagtgcccag 


aaaatgatat 


acattgacac 


cagtgaagat 


gacgctgtag 


tcttcagaga 


tggggatgct 


ggtggattct 


tcttgagaca 


aaggcttggc 


tactatgccg 


atatactatg 


ccgatgatta 


atggatatac 


aagctccctc 


atcctgtctc 


aaggagtgag 


ggctggataa 


agggaggatc 


ggagagacgg 


ggcggatgga 


ggaagaggag 


cccagggagg 


ggggtcaaaa 


gccaaggctt 


ggtgtccatg 


cgggccaggt 


gaaaagacct 


taaaggtgcc 


gtctcgcggc 


catccgacgt 


taacccaacg 


tctcttcttg 


acatctaccg 


tccagtgacc 


taaggtcaaa 


cttaagggag 


aatacagaaa 


cacagtcaga 


cagagacaac 


aagacgcgcg 


gcttcggttc 


caaaccgaaa 


ttaggttctc 


gtctcctacc 


agaaccacat 


cccttcctca 


ggtcgggcca 


caaaaacggc 


tgcggccggg 


tgttcagaac 


tcgtcagttc 


tagctaacta 


gtacagacgc 


aggcgcataa 


gacacagata 


aattactQQc 


cagcttacct 


gggtctccaa 


atcccggacg 


agcccccaaa 


ctcagaggag 


accctcccaa 


ggaacagcga 


ctttattggg 


aatacgggta 


cccgggcgac 


gaggggttgt 


gggctctttt 


attgagctcg 



ccgaagccca acctttcata gaaggcggcg 5480 

tcctgctcct cggccacgaa gtgcttagcc 5540 

acgaatccca actgccgtcg gctgtccatc 5600 

atgcagatcg agaagcacct gtcgacaccg 5660 

atttccgatc gcgacgatac aagtcaggtt 5720 

caccacgagt tctgcacaag gtcccccagt 57 80 

gcggccgtcg ctagagagag ctgcgctggc 5840 

gttgattgta gccgttgctc tttcaatgag 5900 

catggtttag ttcctcacct tgtcgtatta 5960 

attgtcaaca cgtgctgatc agatccgaaa 6020 

ttgatcagat cggaattccg gcgcctagag 6080 

gaggcggggt cgaacgagga ggttcaaggg 6140 

gcggaggctt agggtgtaca aagggcttga 6200 

cccaggtcac gatgtagggg acctggtctg 6260 

tgatcttaac ctgggtgatg aggtctcggt 6320 

taaaggttgg ccattctgca gagcagaagg 6380 

actggttgtg agcgatccgc tcgacatctt 6440 

tggtaacagt ctggccctaa ttttcagaca 6500 

acagaacgat gctgcagcgc tgcagcagac 6560 

gcaaaaattc agacggaggc gggaactgtt 6620 

atcctgacgg ggtcggattc cacatcgact 6680 

ccccaaagtc cctgggacgt ctcccagggt 6740 

caccacgggt ccgccagata cagagctagt 6800 

catcaaacat agacactaga caatcggaca 6860 

cccggtggtg ggtcggtggt ccctgggcag 6920 

tgaaagaccc ccgtcgtggg tagtcaatca 6980 

gaccacgatt cggatgcaaa cagcaagagg 7040 

gcagtctatc ggaagactgg cgcgccgagt 7100 

gagagcggaa gcgcgcgaac agaagcgaga 7160 



24 



agcgaactga 


ttggttagtt 


caaataaggt 


acagggtcat 


tttcaggtcc 


ttggggcacc 


7220 


ctggaaacat 


ctgatgattc 


actagaaact 


gctgagggct 


ggaccgcatc 


tggggaccat 


7280 


ctgttcttgg 


ccccgagccg 


gggcaggaac 


tgcttaccac 


agatatcctg 


tttggcccat 


7340 


cactcagctg 


tctcatctgt 


tcttggccct 


gagccggggc 


aggaaccgct 


taccacagat 


7400 


atcctgtttg 


gtattcagct 


gtttctttgt 


tcctgacctt 


gatctgaact 


tttctattct 


7460 


cagttatgta 


tttttccatg 


ccttgcaaag 


tggcgttact 


taagctagct 


tgccacctac 


7520 


gggtggggtc 


tttcaaaagc 


aaagaaacgg 


aagggccgcg 


gctttggctc 


cggtgagtgt 


7580 


ggggaatagg 


gtagagtggt 


ctaagaggca 


gaagcccgcg 


aatttgggag 


tgtgaggggg 


7640 


acacatttgg 


aggacagttt 


tcggtgatct 


ggtatgaatt 


aattc 




7685 



<210> 14 

<211> 286 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 14 

Met Ser lie Gin His Phe Arg Val Ala Leu lie Pro Phe Phe Ala Ala 
15 10 15 



Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr Leu Val Lys Val Lys 
20 25 30 



Asp Ala Glu Asp Gin Leu Gly Ala Arg Val Gly Tyr lie Glu Leu Asp 
35 40 45 



Leu Asn Ser Gly Lys lie Leu Glu Ser Phe Arg Pro Glu Glu Arg Phe 
50 55 60 



Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys Gly Ala Val Leu Ser 
65 70 75 80 



Arg Val Asp Ala Gly Gin Glu Gin Leu Gly Arg Arg lie His Tyr Ser 
85 90 95 



Gin Asn Asp Leu Val Glu Tyr Ser Pro Val Thr Glu Lys His Leu Thr 
100 105 110 
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Asp Gly Met Thr Val Arg Glu Leu Cys Ser Ala Ala lie Thr Met Ser 
115 120 125 



Asp Asn Thr Ala Ala Asn Leu Leu Leu Thr Thr lie Gly Gly Pro Lys 
130 135 140 



Glu Leu Thr Ala Phe Leu His Asn Met Gly Asp His Val Thr Arg Leu 
145 150 155 160 



Asp Arg Trp Glu Pro Glu Leu Asn Glu Ala lie Pro Asn Asp Glu Arg 
165 170 175 



Asp Thr Thr Met Pro Ala Ala Met Ala Thr Thr Leu Arg Lys Leu Leu 
180 185 190 



Thr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gin Gin Leu lie Asp Trp 
195 200 205 



Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu Arg Ser Ala Leu Pro 
210 215 220 



Ala Gly Trp Phe lie Ala Asp Lys Ser Gly Ala Gly Glu Arg Gly Ser 
225 230 235 240 



Arg Gly lie lie Ala Ala Leu Gly Pro Asp Gly Lys Pro Ser Arg lie 
245 250 255 



Val Val lie Tyr Thr Thr Gly Ser Gin Ala Thr Met Asp Glu Arg Asn 
260 265 270 



Arg Gin lie Ala Glu lie Gly Ala Ser Leu lie Lys His Trp 
275 280 285 



<210> 15 

<211> 7428 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered sequence from virus and plasmid 



<220> 

<221> LTR 

<222> (149) . . (737) 
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<223> 



MoMuSVLTR 



<220> 

<221> CDS 

<222> (1660) . . (2454) 

<223> Neomycin resistance 

<220> 

<2 2 1 > mi sc_f eature 

<222> (3170) (3194) 

<223> attBl 

<220> 

<221> CDS 

<222> (3267) . . (3737) 

<223> trans-dominant BLV Rex (M4) 

<220> 

<2 2 1> mi sc_f eature 

<222> (3800) . . (3824) 

<223> attB2 

<220> 

<221> misc_signal 

<222> (3837) . . (4428) 

<223> WPRE; woodchuck hepatitis virus post-transcriptional regulatory 
element 



<220> 

<221> LTR 

<222> (4474) . . (5067) 
<223> 3' MoMuLVLTR 












<400> 15 
gaattaattc 


ataccagatc 


accgaaaact 


gtcctccaaa 


tgtgtccccc 


tcacactccc 


60 


aaattcgcgg 


gcttctgcct 


cttagaccac 


tctaccctat 


tccccacact 


caccggagcc 


120 


aaagccgcgg 


cccttccgtt 


tctttgcttt 


tgaaagaccc 


cacccgtagg 


tggcaagcta 


180 


gcttaagtaa 


cgccactttg 


caaggcatgg 


aaaaatacat 


aactgagaat 


agaaaagttc 


240 


agatcaaggt 


caggaacaaa 


gaaacagctg 


aataccaaac 


aggatatctg 


tggtaagcgg 


300 


ttcctgcccc 


ggctcagggc 


caagaacaga 


tgagacagct 


gagtgatggg 


ccaaacagga 


360 


tatctgtggt 


aagcagttcc 


tgccccggct 


cggggccaag 


aacagatggt 


ccccagatgc 


420 


ggtccagccc 


tcagcagttt 


ctagtgaatc 


atcagatgtt 


tccagggtgc 


cccaaggacc 


480 


tgaaaatgac 


cctgtacctt 


atttgaacta 


accaatcagt 


tcgcttctcg 


cttctgttcg 


540 


cgcgcttccg 


ctctccgagc 


tcaataaaag 


agcccacaac 


ccctcactcg 


gcgcgccagt 


600 


cttccgatag 


actgcgtcgc 


ccgggtaccc 


gtattcccaa 


taaagcctct 


tgctgtttgc 


660 


atccgaatcg 


tggtctcgct 


gttccttggg 


agggtctcct 


ctgagtgatt 


gactacccac 


720 
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gacgggggtc tttcatttgg gggctcgtcc gggatttgga gacccctgcc cagggaccac 780 

cgacccacca ccgggaggta agctggccag caacttatct gtgtctgtcc gattgtctag 840 

tgtctatgtt tgatgttatg cgcctgcgtc tgtactagtt agctaactag ctctgtatct 900 

ggcggacccg tggtggaact gacgagttct gaacacccgg ccgcaaccct gggagacgtc 960 

ccagggactt tgggggccgt ttttgtggcc cgacctgagg aagggagtcg atgtggaatc 1020 

cgaccccgtc aggatatgtg gttctggtag gagacgagaa cctaaaacag ttcccgcctc 1080 

cgtctgaatt tttgctttcg gtttggaacc gaagccgcgc gtcttgtctg ctgcagcgct 1140 

gcagcatcgt tctgtgttgt ctctgtctga ctgtgtttct gtatttgtct gaaaattagg 1200 

gccagactgt taccactccc ttaagtttga ccttaggtca ctggaaagat gtcgagcgga 1260 

tcgctcacaa ccagtcggta gatgtcaaga agagacgttg ggttaccttc tgctctgcag 1320 

aatggccaac ctttaacgtc ggatggccgc gagacggcac ctttaaccga gacctcatca 1380 

cccaggttaa gatcaaggtc ttttcacctg gcccgcatgg acacccagac caggtcccct 1440 

acatcgtgac ctgggaagcc ttggcttttg acccccctcc ctgggtcaag ccctttgtac 1500 

accctaagcc tccgcctcct cttcctccat ccgccccgtc tctccccctt gaacctcctc 1560 

gttcgacccc gcctcgatcc tccctttatc cagccctcac tccttctcta ggcgccggaa 1620 

ttccgatctg atcaagagac aggatgagga tcgtttcgc atg att gaa caa gat 1674 

Met lie Glu Gin Asp 
1 5 

gga ttg cac gca ggt tct ccg gcc get tgg gtg gag agg eta ttc ggc 1722 
Gly Leu His Ala Gly Ser Pro Ala Ala Trp Val Glu Arg Leu Phe Gly 
10 15 20 

tat gac tgg gca caa cag aca ate ggc tgc tct gat gcc gcc gtg ttc 1770 
Tyr Asp Trp Ala Gin Gin Thr lie Gly Cys Ser Asp Ala Ala Val Phe 
25 30 35 

egg ctg tea gcg cag ggg cgc ccg gtt ctt ttt gtc aag ace gac ctg 1818 
Arg Leu Ser Ala Gin Gly Arg Pro Val Leu Phe Val Lys Thr Asp Leu 
40 45 50 

tec ggt gcc ctg aat gaa ctg cag gac gag gca gcg egg eta teg tgg 1866 
Ser Gly Ala Leu Asn Glu Leu Gin Asp Glu Ala Ala Arg Leu Ser Trp 
55 60 65 

ctg gcc acg acg ggc gtt cct tgc gca get gtg etc gac gtt gtc act 1914 
Leu Ala Thr Thr Gly Val Pro Cys Ala Ala Val Leu Asp Val Val Thr 
70 75 80 85 

gaa gcg gga agg gac tgg ctg eta ttg ggc gaa gtg ccg ggg cag gat 1962 
Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu Val Pro Gly Gin Asp 
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90 



95 



100 



etc ctg tea tct cac ctt get cct gec gag aaa gta tec ate atg get 2010 
Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys Val Ser lie Met Ala 
105 110 115 

gat gca atg egg egg ctg cat acg ctt gat ccg get acc tgc cca ttc 2058 
Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro Ala Thr Cys Pro Phe 
120 125 130 

gac cac caa gcg aaa cat cgc ate gag cga gca cgt act egg atg gaa 2106 
Asp His Gin Ala Lys His Arg lie Glu Arg Ala Arg Thr Arg Met Glu 
135 140 145 

gec ggt ctt gtc gat cag gat gat ctg gac gaa gag cat cag ggg etc 2154 
Ala Gly Leu Val Asp Gin Asp Asp Leu Asp Glu Glu His Gin Gly Leu 
150 155 160 165 

gcg cca gec gaa ctg ttc gee agg etc aag gcg cgc atg ccc gac ggc 2202 
Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala Arg Met Pro Asp Gly 
170 175 180 

gag gat etc gtc gtg acc cat ggc gat gee tgc ttg ccg aat ate atg 2250 
Glu Asp Leu Val Val Thr His Gly Asp Ala Cys Leu Pro Asn lie Met 
185 190 195 

gtg gaa aat ggc cgc ttt tct gga ttc ate gac tgt ggc egg ctg ggt 2298 
Val Glu Asn Gly Arg Phe Ser Gly Phe lie Asp Cys Gly Arg Leu Gly 
200 205 210 

gtg gcg gac cgc tat cag gac ata gcg ttg get acc cgt gat att get 2346 
Val Ala Asp Arg Tyr Gin Asp lie Ala Leu Ala Thr Arg Asp lie Ala 
215 220 225 

gaa gag ctt ggc ggc gaa tgg get gac cgc ttc etc gtg ctt tac ggt 2394 
Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe Leu Val Leu Tyr Gly 
230 235 240 245 

ate gee get ccc gat teg cag cgc ate gec ttc tat cgc ctt ctt gac 2442 
lie Ala Ala Pro Asp Ser Gin Arg lie Ala Phe Tyr Arg Leu Leu Asp 
250 255 260 

gag ttc ttc tga gegggactet ggggttcgaa atgaccgacc aagcgacgcc 2494 
Glu Phe Phe 



caacctgcca 


tcacgagatt 


tcgattccac 


cgccgccttc 


tatgaaaggt 


tgggcttegg 


2554 


aatcgttttc 


cgggacgccg 


gctggatgat 


cctccagcgc 


ggggatctca 


tgctggagtt 


2614 


cttcgcccac 


cccgggctcg 


atcccctcgc 


gagttggttc 


agctgctgcc 


tgaggctgga 


2674 


cgacctcgcg 


gagttctacc 


ggcagtgcaa 


atccgtcggc 


atccaggaaa 


ccagcagcgg 


2734 


ctatccgcgc 


atccatgccc 


ccgaactgca 


ggagtgggga 


ggcacgatgg 


ccgctttggt 


2794 


egaggeggat 


cctagcagaa 


aaataagact 


tgattccccc 


ttaaaattac 


aactgetaga 


2854 
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aaatgaatgg ctctcccgcc ttttttgagg gggaatcatt tgtatgaaag atcatgccga 2914 

cctaggcgcc gccaccgccc cgtaaaccag acagagacgt cagctgccag aaaagctggt 2974 

gacggcagct ggtggctaga atccccgtac ctccccaact tcccctttcc cgaaaaatcc 3034 

acaccctgag ctgctgacct cacctgctga taaattaata aaatgccggc cctgtcgagt 3094 

tagcggcacc agaagcgttc ttctcctgag accctcgtgc tcagctctcg gtcctgcctc 3154 

gagaagcttg ttatcacaag tttgtacaaa aaagcaggct tcgaaggaga tagaaccaat 3214 

tctctaagga aatacttaac gtcgactgga tccggtaccg aattcgatcc ac atg cct 3272 

Met Pro 
265 

aaa aaa cga egg tec cga aga cgc cca caa ccg ate ate aga tgg caa 3320 
Lys Lys Arg Arg Ser Arg Arg Arg Pro Gin Pro lie lie Arg Trp Gin 
270 275 280 

gtg ttg ttg gtt ggg ggc ccc act etc tac atg cct gec egg ccc tgg 3368 
Val Leu Leu Val Gly Gly Pro Thr Leu Tyr Met Pro Ala Arg Pro Trp 
285 290 295 

ttt tgt cca atg atg tea cca teg atg cct ggt gec ccc tct gcg ggc 3416 
Phe Cys Pro Met Met Ser Pro Ser Met Pro Gly Ala Pro Ser Ala Gly 
300 305 310 

ccc atg age gac tec aat teg aaa gga teg aca cca cgc tea cct gcg 3464 
Pro Met Ser Asp Ser Asn Ser Lys Gly Ser Thr Pro Arg Ser Pro Ala 
315 320 325 330 

aga ccc ace gta tea act gga ccg ccg atg gac gac ctt gcg gee tea 3512 
Arg Pro Thr Val Ser Thr Gly Pro Pro Met Asp Asp Leu Ala Ala Ser 
335 340 345 

atg gaa cgt tgt tec etc gac tgc atg tct ccg aga ccc gee ccc aag 3560 
Met Glu Arg Cys Ser Leu Asp Cys Met Ser Pro Arg Pro Ala Pro Lys 
350 355 360 

ggc ccc gac gac tct gga tea act gee ccc ttc egg ccg ttc gcg etc 3608 
Gly Pro Asp Asp Ser Gly Ser Thr Ala Pro Phe Arg Pro Phe Ala Leu 
365 370 375 

age ccg gee egg tta gat ctt ccc cct teg age ggt ccc cct tec age 3656 
Ser Pro Ala Arg Leu Asp Leu Pro Pro Ser Ser Gly Pro Pro Ser Ser 
380 385 390 

cct ace aat gee aat tgc cct egg cct eta gcg acg gtt gec cca tta 3704 
Pro Thr Asn Ala Asn Cys Pro Arg Pro Leu Ala Thr Val Ala Pro Leu 
395 400 405 410 

teg ggc acg gee ttc ttc cct gga aca act tag taacgcatcc tgtcctcaga 37 57 
Ser Gly Thr Ala Phe Phe Pro Gly Thr Thr 
415 420 
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aaagtcctta 


tattaaatca 


aataaaacct 

L* * W 23 23 23 w 


caaaatatct 


agacccagct 


ttcttgtaca 


3817 


aaataataat 


aacatcgata 


atcaacctct 


ggattacaaa 


atttgtgaaa 


aattaactaa 

23 ^* w w 23 23 23 


3877 


tattcttaac 

w L* L- V* L- \* 


ta tat terete 

LA \mr X-J W L* L^ W N— ' 


ettttacget 


atataaatac 


getgetttaa 


tgcctttgta 


3937 


tcatgctatt 


acttcccata 


tggctttcat 


tttctcctcc 


ttgtataaat 


cctggttgct 


3997 


gtctctttat 


aaaaaattat 


aacccattat 

53 53 ^ ^ 53 *— ^ 53 


caggcaacgt 


aacataatat 


gcactgtgtt 


4057 


tactaacaca 


acccccactg 


attaaaacat 


tgccaccacc 


tgtcagctcc 


tttcegggae 


4117 


tttcgctttc 


cccctcccta 


ttgccacggc 


ggaactcatc 


gccgcctgcc 


ttgcccgctg 


4177 


c t era acacrcrcr 


geteggctgt 


taaacactaa 


caattcegtg 


atattatcaa 


ggaaatcatc 


4237 


gtcctttcct 


tggctgeteg 


cctgtgttgc 


cacctggatt 


c tcrcacaaaa 


cgtccttctg 


4297 


ctacgtccct 


tcggccctca 


atccaacaaa 


ccttccttcc 


cacaacctac 


taccaactct 

w 23 ^—^ ^* 23 23 


4357 


gcggcctctt 


ccgcgtcttc 


gccttcgccc 


tcagacgagt 


cggatctccc 


tttaaaccac 

w w 23 23 23 23 


4417 


PtCCCCOCCt 

L^i l — L^r \w» V** V— * V--J V—" L* 


aatcaataaa 

LA L- L" V3 Mi UU.L4.Li 


ataaaagatt 


ttatttagtc 


tccagaaaaa 


aaaaaaaata 


4477 


aaaaacccca 


cctgtaggtt 


taacaaacta 


gcttaagtaa 


cgccattttg 


caaggcatgg 


4537 


aaaaatacat 


aactgagaat 


aaaaaaattc 


aaatcaaaat 

l* 23 ^* v^^*v*23 23 


caggaacaga 


tggaacagct 


4597 


aaatataaac 


caaacaggat 


atctgtggta 


agcagttcct 


accccaactc 


aaoaccaaaa 

L*y 23 23 L4L4^ Lt 


4657 


acagatggaa 


cagctgaata 


tgggecaaac 


aggatatctg 


taataaacaa 


ttcctgcccc 


4717 


croc t cacrcrcTC 


caagaacaga 


tggtccccag 


atgeggtcca 


gccctcagca 


at ttctaaaa 


4777 


aaccatcaoa 


tatttccaaa 

L.y L* l* L-v^v^uy LJ 


ataccccaaa 


aacctaaaat 

U.V \m* C VJLAUIL4 \m* 


aaccctatac 


ct tatt taaa 

*w Sm~ U LA W L- L- V_^J LA %A 


4837 


ctaaccaatc 


agttcgcttc 


tegcttctgt 


tcacaccrctt 


ctgctccccg 


agctcaataa 


4897 


aacfacrrccac 


aacccctcac 

L, L^ L* L^ LA \<# 


tcaaaacacc 


aat cctccaa 


t taactaaat 


cacccaaata 

\^ \£ v> \4 23 23 


4957 


cccatatat c 


caataaaccc 

WUU. L* LA LA LA V-r ^ 


tcttacaatt 


ocatccoact 


tataatctca 


c tat tec t ta 

L» L» L» L^ L- Lj» 


5017 


crcf aaaa t c t c 

23 23 27 23 2? v** L, 


ctctaaataa 


ttaactaccc 


y *-v,uy^yyyy 


atctttcatt 


taaaaactca 
^yyyyy^^-'-y 


5077 




ooaaacccct 


yuv^uuyyyuv 


caccaacrca 




y i»t*o.y ^> i»yy v< 


5137 




at t - 1* e*aat~ era 

y L. U L> 53 53 V- 53 


t*craraat*craa 

uy Li^y y uy liu 




a p a t* cr p a err* t 


ccc a ci a a Pica 

Vw* 53 5j c«. *w 53 


5197 


rTi~ r*^ rtr* t~ t* 




yydLyLLyyy 






L»y L.y L.v^ciy L.y 


ji. j / 


ggtgttggcg 


ggtgtcgggg 


cgcagccatg 


acccagtcac 


gtagegatag 


cggagtgtat 


5317 


actggcttaa 


etatgeggea 


tcagagcaga 


ttgtactgag 


agtgcaccat 


atgcggtgtg 


5377 


aaataccgea 


cagatgegta 


aggagaaaat 


accgcatcag 


gcgctcttcc 


gcttcctcgc 


5437 


tcactgactc 


gctgcgctcg 


gtcgttcggc 


tgeggegage 


ggtatcagct 


cactcaaagg 


5497 



cggtaatacg 


gttatccaca 


gaatcagggg 


ataacgcagg 


aaagaacatg 


tgagcaaaag 


5557 


gccagcaaaa 


ggccaggaac 


cgtaaaaagg 


ccgcgttgct 


ggcgtttttc 


cataggctcc 


5617 


gcccccctga 


cgagcatcac 


aaaaatcgac 


gctcaagtca 


gaggtggcga 


aacccgacag 


5677 


gactataaag 


ataccaggcg 


tttccccctg 


gaagctccct 


cgtgcgctct 


cctgttccga 


5737 


ccctgccgct 


taccggatac 


ctgtataaag 


ataccaggcg 


tttccccctg 


gaagctccct 


5797 


cgtgcgctct 


cctgttccga 


ccctgccgct 


taccggatac 


ctgctccaag 


ctgggctgtg 


5857 


tgcacgaacc 


ccccgttcag 


cccgaccgct 


gcgccttatc 


cggtaactat 


cgtcttgagt 


5917 


ccaacccggt 


aagacacgac 


ttatcgccac 


tggcagcagc 


cactggtaac 


aggattagca 


5977 


gagcgaggta 


tgtaggcggt 


gctacagagt 


tcttgaagtg 


gtggcctaac 


tacggctaca 


6037 


ctagaaggac 


agtatttggt 


atctgcgctc 


tgctgaagcc 


agttaccttc 


ggaaaaagag 


6097 


ttggtagctc 


ttgatccggc 


aaacaaacca 


ccgctggtag 


cggtggtttt 


tttgtttgca 


6157 


agcagcagat 


tacgcgcaga 


aaaaaaggat 


ctcaagaatc 


ctttgatctt 


tgatctacgg 


6217 


ggtctgacgc 


tcagtggaac 


gaaaactcac 


gttaagggat 


tttggtcatg 


agattatcaa 


6277 


aaaggatctt 


cacctagatc 


cttttaaatt 


aaaaatgaag 


ttttaaatca 


atctaaagta 


6337 


tatatgagta 


aacttggtct 


gacagttacc 


aatgcttaat 


cagtgaggca 


cctatctcag 


6397 


cgatctgtct 


atttcgttca 


tccatagttg 


cctgactccc 


cgtcgtgtag 


ataactacga 


6457 


tacgggaggg 


cttaccatct 


ggccccagtg 


ctgcaatgat 


accgcgagac 


ccacgctcac 


6517 


cggctccaga 


tttatcagca 


ataaaccagc 


cagccggaag 


ggccgagcgc 


agaagtggtc 


6577 


ctgcaacttt 


atccgcctcc 


atccagtcta 


ttaattgttg 


ccgggaagct 


agagtaagta 


6637 


gttcgccagt 


taatagtttg 


cgcaacgttg 


ttgccattgc 


tgcaggcatc 


gtggtgtcac 


6697 


gctcgtcgtt 


tggtatggct 


tcattcagct 


ccggttccca 


acgatcaagg 


cgagttacat 


6757 


gatcccccat 


gttgtgcaaa 


aaagcggtta 


gctccttcgg 


tcctccgatc 


gttgtcagaa 


6817 


gtaagttggc 


cgcagtgtta 


tcactcatgg 


ttatggcagc 


actgcataat 


tctcttactg 


6877 


tcatgccatc 


cgtaagatgc 


ttttctgtga 


ctggtgagta 


ctcaaccaag 


tcattctgag 


6937 


aatagtgtat 


gcggcgaccg 


agttgctctt 


gcccggcgtc 


aacacgggat 


aataccgcgc 


6997 


cacatagcag 


aactttaaaa 


gtgctcatca 


ttggaaaacg 


ttcttcgggg 


cgaaaactct 


7057 


caaggatctt 


accgctgttg 


agatccagtt 


cgatgtaacc 


cactcgtgca 


cccaactgat 


7117 


cttcagcatc 


ttttactttc 


accagcgttt 


ctgggtgagc 


aaaaacagga 


aggcaaaatg 


7177 



ccgcaaaaaa 


gggaataagg 


gcgacacgga 


aatgttgaat 


actcatactc 


ttcctttttc 


7237 


aatattattg 


aagcatttat 


cagggttatt 


gtctcatgag 


cggatacata 


tttgaatgta 


7297 


tttagaaaaa 


taaacaaata 


ggggttccgc 


gcacatttcc 


ccgaaaagtg 


ccacctgacg 


7357 


tctaagaaac 


cattattatc 


atgacattaa 


cctataaaaa 


taggcgtatc 


acgaggccct 


7417 


ttcgtcttca 


a 










7428 



<210> 16 

<211> 264 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic Construct 
<400> 16 



Met lie Glu Gin Asp Gly Leu His Ala Gly Ser Pro Ala Ala Trp Val 
15 10 15 



Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gin Gin Thr lie Gly Cys Ser 
20 25 30 



Asp Ala Ala Val Phe Arg Leu Ser Ala Gin Gly Arg Pro Val Leu Phe 
35 40 45 



Val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gin Asp Glu Ala 
50 55 60 



Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly Val Pro Cys Ala Ala Val 
65 70 75 80 



Leu Asp Val Val Thr Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu 
85 90 95 



Val Pro Gly Gin Asp Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys 
100 105 110 



Val Ser lie Met Ala Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro 
115 120 125 



Ala Thr Cys Pro Phe Asp His Gin Ala Lys His Arg lie Glu Arg Ala 
130 135 140 
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Arg Thr Arg Met Glu Ala Gly Leu Val Asp Gin Asp Asp Leu Asp Glu 
145 ~ 150 155 160 



Glu His Gin Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala 
165 170 175 



Arg Met Pro Asp Gly Glu Asp Leu Val Val Thr His Gly Asp Ala Cys 
180 185 190 



Leu Pro Asn He Met Val Glu Asn Gly Arg Phe Ser Gly Phe He Asp 
195 200 205 



Cys Gly Arg Leu Gly Val Ala Asp Arg Tyr Gin Asp He Ala Leu Ala 
210 215 220 



Thr Arg Asp He Ala Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe 
225 230 235 240 



Leu Val Leu Tyr Gly He Ala Ala Pro Asp Ser Gin Arg He Ala Phe 
245 250 255 



Tyr Arg Leu Leu Asp Glu Phe Phe 
260 



<210> 


17 


<211> 


156 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic Construct 


<400> 


17 



Met Pro Lys Lys Arg Arg Ser Arg Arg Arg Pro Gin Pro He He Arg 
15 10 15 



Trp Gin Val Leu Leu Val Gly Gly Pro Thr Leu Tyr Met Pro Ala Arg 
20 25 30 



Pro Trp Phe Cys Pro Met Met Ser Pro Ser Met Pro Gly Ala Pro Ser 
35 40 45 



Ala Gly Pro Met Ser Asp Ser Asn Ser Lys Gly Ser Thr Pro Arg Ser 
50 55 60 
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Pro Ala Arg Pro Thr Val Ser Thr Gly Pro Pro Met Asp Asp Leu Ala 
65 70 75 80 



Ala Ser Met Glu Arg Cys Ser Leu Asp Cys Met Ser Pro Arg Pro Ala 
85 90 95 



Pro Lys Gly Pro Asp Asp Ser Gly Ser Thr Ala Pro Phe Arg Pro Phe 
100 105 110 



Ala Leu Ser Pro Ala Arg Leu Asp Leu Pro Pro Ser Ser Gly Pro Pro 
115 120 125 



Ser Ser Pro Thr Asn Ala Asn Cys Pro Arg Pro Leu Ala Thr Val Ala 
130 135 140 



Pro Leu Ser Gly Thr Ala Phe Phe Pro Gly Thr Thr 
145 150 155 



<210> 18 

<211> 7428 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered sequence form virus and plasmid 



<220> 

<221> CDS 

<222> (206) . . (1066) 

<223> Ampicillin resistance 

<400> 18 

ttgaagacga aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat 60 

ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt 120 

atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct 180 

tcaataatat tgaaaaagga agagt atg agt att caa cat ttc cgt gtc gcc 232 

Met Ser lie Gin His Phe Arg Val Ala 
1 5 

ctt att ccc ttt ttt gcg gca ttt tgc ctt cct gtt ttt get cac cca 280 
Leu lie Pro Phe Phe Ala Ala Phe Cys Leu Pro Val Phe Ala His Pro 
10 15 20 25 

gaa acg ctg gtg aaa gta aaa gat get gaa gat cag ttg ggt gca cga 328 
Glu Thr Leu Val Lys Val Lys Asp Ala Glu Asp Gin Leu Gly Ala Arg 
30 35 40 



35 



gtg ggt tac ate gaa ctg gat etc aac age ggt aag ate ctt gag agt 376 
Val Gly Tyr lie Glu Leu Asp Leu Asn Ser Gly Lys lie Leu Glu Ser 
45 50 55 

ttt cgc ccc gaa gaa cgt ttt cca atg atg age act ttt aaa gtt ctg 424 
Phe Arg Pro Glu Glu Arg Phe Pro Met Met Ser Thr Phe Lys Val Leu 
60 65 70 

eta tgt ggc gcg gta tta tec cgt gtt gac gec ggg caa gag caa etc 472 
Leu Cys Gly Ala Val Leu Ser Arg Val Asp Ala Gly Gin Glu Gin Leu 
75 80 85 

ggt cgc cgc ata cac tat tct cag aat gac ttg gtt gag tac tea cca 520 
Gly Arg Arg lie His Tyr Ser Gin Asn Asp Leu Val Glu Tyr Ser Pro 
90 95 100 105 

gtc aca gaa aag cat ctt acg gat ggc atg aca gta aga gaa tta tgc 568 
Val Thr Glu Lys His Leu Thr Asp Gly Met Thr Val Arg Glu Leu Cys 
110 115 120 

agt get gee ata acc atg agt gat aac act gcg gee aac tta ctt ctg 616 
Ser Ala Ala lie Thr Met Ser Asp Asn Thr Ala Ala Asn Leu Leu Leu 
125 130 135 

aca acg ate gga gga ccg aag gag eta acc get ttt ttg cac aac atg 664 
Thr Thr lie Gly Gly Pro Lys Glu Leu Thr Ala Phe Leu His Asn Met 
140 145 150 

ggg gat cat gta act cgc ctt gat cgt tgg gaa ccg gag ctg aat gaa 712 
Gly Asp His Val Thr Arg Leu Asp Arg Trp Glu Pro Glu Leu Asn Glu 
155 160 165 

gec ata cca aac gac gag cgt gac acc acg atg cct gca gca atg gca 760 
Ala lie Pro Asn Asp Glu Arg Asp Thr Thr Met Pro Ala Ala Met Ala 
170 175 180 185 

aca acg ttg cgc aaa eta tta act ggc gaa eta ctt act eta get tec 808 
Thr Thr Leu Arg Lys Leu Leu Thr Gly Glu Leu Leu Thr Leu Ala Ser 
190 195 200 

egg caa caa tta ata gac tgg atg gag gcg gat aaa gtt gca gga cca 856 
Arg Gin Gin Leu lie Asp Trp Met Glu Ala Asp Lys Val Ala Gly Pro 
205 210 215 

ctt ctg cgc teg gec ctt ccg get ggc tgg ttt att get gat aaa tct 904 
Leu Leu Arg Ser Ala Leu Pro Ala Gly Trp Phe lie Ala Asp Lys Ser 
220 225 230 

gga gee ggt gag cgt ggg tct cgc ggt ate att gca gca ctg ggg cca 952 
Gly Ala Gly Glu Arg Gly Ser Arg Gly lie He Ala Ala Leu Gly Pro 
235 240 245 

gat ggt aag ccc tec cgt ate gta gtt ate tac acg acg ggg agt cag 1000 
Asp Gly Lys Pro Ser Arg He Val Val He Tyr Thr Thr Gly Ser Gin 
250 255 260 265 
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gca act atg gat gaa cga aat aga cag ate get gag ata ggt gee tea 1048 
Ala Thr Met Asp Glu Arg Asn Arg Gin lie Ala Glu lie Gly Ala Ser 
270 275 280 



ctg att aag cat tgg taa ctgtcagacc aagtttactc atatatactt 1096 
Leu lie Lys His Trp 
285 



tagattgatt 


taaaacttca 


tttttaattt 


aaaaggatct 


aggtgaagat 


cctttttgat 


1156 


aatctcatga 


ccaaaatccc 


ttaacgtgag 


ttttcgttcc 


actgagegtc 


agaccccgta 


1216 


gatcaaagat 


caaaggattc 


ttgagatcct 


ttttttctgc 


gcgtaatctg 


ctgcttgcaa 


1276 


acaaaaaaac 


caccgctacc 


aqcgqtggtt 

3 3 3 3 3 


tgtttgccgg 


atcaagagct 


accaactctt 


1336 


tttccgaagg 


taactggctt 


cagcagagcg 


cagataccaa 


atactgtcct 


tctagtgtag 


1396 


cccrtaattaq 


gccaccactt 


caagaactct 


gtagcaccgc 


ctacatacct 


cgctctgcta 


1456 


atcctgttac 


caataactcrc 


tgccagtggc 


gataagtcgt 


gtcttacegg 


gttggactca 


1516 


aaacaatacrt 


taceggataa 


qqcqcaqcqq 

3 3 3 3 3 3 


tegggctgaa 

ZJ ZJ ZJ ~~ 


eggggggtte 

ZJ ZJ ZJ ZJ ZJ ZJ 


gtgcacacag 


1576 


cccagcttgg 


agcaggtatc 


eggtaagegg 


cagggtegga 


acaggagagc 


gcacgaggga 


1636 


acttccaacra 


ggaaacgect 


ggtatcttta 


tacaggtatc 


eggtaagegg 

ZJ ZJ ZJ ZJ ZJ 


cagggtegga 


1696 


acaaoacraQc 


qcacqaqqqa 

3 3 3 3 3 


gcttccaggg 

3 3 3 3 


ggaaacgect 


ggtatcttta 


tagtcctgtc 


1756 


gqqtttcgcc 


acctctgact 


tgagegtega 


tttttgtgat 


getegtcagg 


ggggeggage 


1816 


ctatggaaaa 


acgccagcaa 


cgcggccttt 


ttacggttcc 


tggccttttg 


ctggcctttt 


1876 


cictcacatat 


tctttcctgc 


gttatcccct 


gattctgtgg 


ataacegtat 


taccgccttt 


1936 


aaatcracrcta 
ys*y i*y«y^ *-3 


ataccgctcg 


ccgcagccga 


acgaccgagc 


geagegagtc 


agtqagcgag 

3 3 3 3 3 


1996 




aacacctoat 


gcggtatttt 


ctccttacgc 


atctqtacqq 


tatttcacac 


2056 


r~> n pi 1 - t - n<"T t~ 
La La Lyy L. 






t rtaa t~ arra 


catagt taag 


ccagtataca 


2116 


ctccgctatc 


getaegtgae 


tgggtcatgg 


ctgcgccccg 


acacccgcca 


acacccgctg 


2176 


acgcgccctg 


aegggcttgt 


ctgctcccgg 


catccgctta 


cagacaagct 


gtgacegtet 


2236 


ccgggagctg 


catgtgtcag 


aggttttcac 


cgtcatcacc 


gaaacgcgcg 


aggcagecag 


2296 


cttacctccc 


ggtggtgggt 


cggtggtccc 


tgggcagggg 


tctcccgatc 


ccggacgagc 


2356 


ccccaaatga 


aagacccccg 


ctgacgggta 


gtcaatcact 


cagaggagac 


cctcccaagg 


2416 


aacagegaga 


ccacaagtcg 


gatgeaactg 


caagagggtt 


tattggatac 


acgggtaccc 


2476 


gggegactea 


gtcaatcgga 


ggactggcgc 


cccgagtgag 


gggttgtggg 


ctcttttatt 


2536 


gagctegggg 


ageagaageg 


cgegaacaga 


agegagaage 


gaactgattg 


gttagttcaa 


2596 
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ataaggcaca gggtcatttc aggtccttgg ggcaccctgg aaacatctga tggttctcta 2656 

gaaactgctg agggctggac cgcatctggg gaccatctgt tcttggccct gagccggggc 2716 

aggaactgct taccacagat atcctgtttg gcccatattc agctgttcca tctgttcttg 2776 

gccctgagcc ggggcaggaa ctgcttacca cagatatcct gtttggccca tattcagctg 2836 

ttccatctgt tcctgacctt gatctgaact tctctattct cagttatgta tttttccatg 2896 

ccttgcaaaa tggcgttact taagctagct tgccaaacct acaggtgggg tctttcattc 2956 

cccccttttt ctggagacta aataaaatct tttattttat cgatcaggcg gggaggcggc 3016 

ccaaagggag atccgactcg tctgagggcg aaggcgaaga cgcggaagag gccgcagagc 3076 

cggcagcagg ccgcgggaag gaaggtccgc tggattgagg gccgaaggga cgtagcagaa 3136 

ggacgtcccg cgcagaatcc aggtggcaac acaggcgagc agccaaggaa aggacgatga 3196 

tttccccgac aacaccacgg aattgtcagt gcccaacagc cgagcccctg tccagcagcg 3256 

ggcaaggcag gcggcgatga gttccgccgt ggcaataggg agggggaaag cgaaagtccc 3316 

ggaaaggagc tgacaggtgg tggcaatgcc ccaaccagtg ggggttgcgt cagcaaacac 3376 

agtgcacacc acgccacgtt gcctgacaac gggccacaac tcctcataaa gagacagcaa 3436 

ccaggattta tacaaggagg agaaaatgaa agccatacgg gaagcaatag catgatacaa 3496 

aggcattaaa gcagcgtatc cacatagcgt aaaaggagca acatagttaa gaataccagt 3556 

caatctttca caaattttgt aatccagagg ttgattatcg atgttatcac cactttgtac 3616 

aagaaagctg ggtctagata tctcgaggtc ccatttgatt taatataagg acttttctga 3676 

ggacaggatg cgttactaag ttgttccagg gaagaaggcc gtgcccgata atggggcaac 3736 

cgtcgctaga ggccgagggc aattggcatt ggtagggctg gaagggggac cgctcgaagg 3796 

gggaagatct aaccgggccg ggctgagcgc gaacggccgg aagggggcag ttgatccaga 3856 

gtcgtcgggg cccttggggg cgggtctcgg agacatgcag tcgagggaac aacgttccat 3916 

tgaggccgca aggtcgtcca tcggcggtcc agttgatacg gtgggtctcg caggtgagcg 3976 

tggtgtcgat cctttcgaat tggagtcgct catggggccc gcagaggggg caccaggcat 403 6 

cgatggtgac atcattggac aaaaccaggg ccgggcaggc atgtagagag tggggccccc 4096 

aaccaacaac acttgccatc tgatgatcgg ttgtgggcgt cttcgggacc gtcgtttttt 4156 

aggcatgtgg atcgaattcg gtaccggatc cagtcgacgt taagtatttc cttagagaat 4216 

tggttctatc tccttcgaag cctgcttttt tgtacaaact tgtgataaca agcttctcga 4276 
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ggcaggaccg 


agagctgagc 


acgagggtct 


caggagaaga 


acgcttctgg 


tgccgctaac 


4336 


tcgacagggc 


cggcatttta 


ttaatttatc 


agcaggtgag 


gtcagcagct 


cagggtgtgg 


4396 


atttttcggg 


aaaggggaag 


ttggggaggt 


acggggattc 


tagccaccag 


ctgccgtcac 


4456 


cagcttttct 


ggcagctgac 


gtctctgtct 


ggtttacggg 


gcggtggcgg 


cgcctaggtc 


4516 


ggcatgatct 


ttcatacaaa 


tgattccccc 


tcaaaaaagg 


cgggagagcc 


attcattttc 


4576 


tagcagttgt 


aattttaagg 


gggaatcaag 


tcttattttt 


ctgctaggat 


ccgcctcgac 


4636 


caaagcggcc 


atcgtgcctc 


cccactcctg 


cagttcgggg 


gcatggatgc 


gcggatagcc 


4696 


gctgctggtt 


tcctggatgc 


cgacggattt 


gcactgccgg 


tagaactccg 


cgaggtcgtc 


4756 


cagcctcagg 


cagcagctga 


accaactcgc 


gaggggatcg 


agcccggggt 


gggcgaagaa 


4816 


ctccagcatg 


agatccccgc 


gctggaggat 


catccagccg 


gcgtcccgga 


aaacgattcc 


4876 


gaagcccaac 


ctttcataga 


aggcggcggt 


ggaatcgaaa 


tctcgtgatg 


gcaggttggg 


4936 


cgtcgcttgg 


tcggtcattt 


cgaaccccag 


agtcccgctc 


agaagaactc 


gtcaagaagg 


4996 


cgatagaagg 


cgatgcgctg 


cgaatcggga 


gcggcgatac 


cgtaaagcac 


gaggaagcgg 


5056 


tcagcccatt 


cgccgccaag 


ctcttcagca 


atatcacggg 


tagccaacgc 


tatgtcctga 


5116 


tagcggtccg 


ccacacccag 


ccggccacag 


tcgatgaatc 


cagaaaagcg 


gccattttcc 


5176 


accatgatat 


tcggcaagca 


ggcatcgcca 


tgggtcacga 


cgagatcctc 


gccgtcgggc 


5236 


atgcgcgcct 


tgagcctggc 


gaacagttcg 


gctggcgcga 


gcccctgatg 


ctcttcgtcc 


5296 


agatcatcct 


gatcgacaag 


accggcttcc 


atccgagtac 


gtgctcgctc 


gatgcgatgt 


5356 


ttcgcttggt 


ggtcgaatgg 


gcaggtagcc 


ggatcaagcg 


tatgcagccg 


ccgcattgca 


5416 


tcagccatga 


tggatacttt 


ctcggcagga 


gcaaggtgag 


atgacaggag 


atcctgcccc 


5476 


ggcacttcgc 


ccaatagcag 


ccagtccctt 


cccgcttcag 


tgacaacgtc 


gagcacagct 


5536 


gcgcaaggaa 


cgcccgtcgt 


ggccagccac 


gatagccgcg 


ctgcctcgtc 


ctgcagttca 


5596 


ttcagggcac 


cggacaggtc 


ggtcttgaca 


aaaagaaccg 


ggcgcccctg 


cgctgacagc 


5656 


cggaacacgg 


cggcatcaga 


gcagccgatt 


gtctgttgtg 


cccagtcata 


gccgaatagc 


5716 


ctctccaccc 


aaacQQCcgg 


agaacctgcg 


tgcaatccat 


cttgttcaat 


catgcgaaac 


5776 


gatcctcatc 


ctgtctcttg 


atcagatcgg 


aattccggcg 


cctagagaag 


gagtgagggc 


5836 


tggataaagg 


gaggatcgag 


gcggggtcga 


acgaggaggt 


tcaaggggga 


gagacggggc 


5896 


ggatggagga 


agaggaggcg 


gaggcttagg 


gtgtacaaag 


ggcttgaccc 


agggaggggg 


5956 


gtcaaaagcc 


aaggcttccc 


aggtcacgat 


gtaggggacc 


tggtctgggt 


gtccatgcgg 


6016 



gccaggtgaa 


aagaccttga 


tcttaacctg 


ggtgatgagg 


tctcggttaa 


aggtgccgtc 


6076 


tcgcggccat 


ccgacgttaa 


aggttggcca 


ttctgcagag 


cagaaggtaa 


cccaacgtct 


6136 


cttcttgaca 


tctaccgact 


ggttgtgagc 


gatccgctcg 


acatctttcc 


agtgacctaa 


6196 


ggtcaaactt 


aagggagtgg 


taacagtctg 


gccctaattt 


tcagacaaat 


acagaaacac 


6256 


agtcagacag 


agacaacaca 


gaacgatgct 


gcagcgctgc 


agcagacaag 


acgcgcggct 


6316 


tcggttccaa 


accgaaagca 


aaaattcaga 


cggaggcggg 


aactgtttta 


ggttctcgtc 


6376 


tcctaccaga 


accacatatc 


ctgacggggt 


cggattccac 


atcgactccc 


ttcctcaggt 


6436 


cgggccacaa 


aaacggcccc 


caaagtccct 


gggacgtctc 


ccagggttgc 


ggccgggtgt 


6496 


tcagaactcg 


tcagttccac 


cacgggtccg 


ccagatacag agctagttag 


ctaactagta 


6556 


cagacgcagg 


cgcataacat 


caaacataga 


cactagacaa 


tcggacagac 


acagataagt 


6616 


tgctggccag 


cttacctccc 


ggtggtgggt 


cggtggtccc 


tgggcagggg 


tctccaaatc 


6676 


ccggacgagc 


ccccaaatga 


aagacccccg 


tcgtgggtag 


tcaatcactc 


agaggagacc 


6736 


ctcccaagga 


acagcgagac 


cacgattcgg 


atgcaaacag 


caagaggctt 


tattgggaat 


6796 


acgggtaccc 


gggcgacgca 


gtctatcgga 


agactggcgc 


gccgagtgag 


gggttgtggg 


6856 


ctcttttatt 


gagctcggag 


agcggaagcg 


cgcgaacaga 


agcgagaagc 


gaactgattg 


6916 


gttagttcaa 


ataaggtaca 


gggtcatttt 


caggtccttg gggcaccctg gaaacatctg 


6976 


atgattcact 


agaaactgct 


gagggctgga 


ccgcatctgg 


ggaccatctg 


ttcttggccc 


7036 


cgagccgggg 


caggaactgc 


ttaccacaga 


tatcctgttt 


ggcccatcac 


tcagctgtct 


7096 


catctgttct 


tggccctgag 


ccggggcagg 


aaccgcttac 


cacagatatc 


ctgtttggta 


7156 


ttcagctgtt 


tctttgttcc 


tgaccttgat 


ctgaactttt 


ctattctcag 


ttatgtattt 


7216 


ttccatgcct 


tgcaaagtgg 


cgttacttaa 


gctagcttgc 


cacctacggg 


tggggtcttt 


7276 


caaaagcaaa 


gaaacggaag 


ggccgcggct 


ttggctccgg 


tgagtgtggg 


gaatagggta 


7336 


gagtggtcta 


agaggcagaa 


gcccgcgaat 


ttgggagtgt 


gagggggaca 


catttggagg 


7396 


acagttttcg 


gtgatctggt 


atgaattaat 


tc 






7428 



<210> 19 

<211> 286 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
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<400> 19 



Met Ser lie Gin His Phe Arg Val Ala Leu lie Pro Phe Phe Ala Ala 
15 10 15 



Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr Leu Val Lys Val Lys 
20 25 30 



Asp Ala Glu Asp Gin Leu Gly Ala Arg Val Gly Tyr lie Glu Leu Asp 
35 40 45 



Leu Asn Ser Gly Lys lie Leu Glu Ser Phe Arg Pro Glu Glu Arg Phe 
50 55 60 



Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys Gly Ala Val Leu Ser 
65 70 75 80 



Arg Val Asp Ala Gly Gin Glu Gin Leu Gly Arg Arg lie His Tyr Ser 
85 90 95 



Gin Asn Asp Leu Val Glu Tyr Ser Pro Val Thr Glu Lys His Leu Thr 
100 105 110 



Asp Gly Met Thr Val Arg Glu Leu Cys Ser Ala Ala lie Thr Met Ser 
115 120 125 



Asp Asn Thr Ala Ala Asn Leu Leu Leu Thr Thr lie Gly Gly Pro Lys 
130 135 140 



Glu Leu Thr Ala Phe Leu His Asn Met Gly Asp His Val Thr Arg Leu 
145 150 155 160 



Asp Arg Trp Glu Pro Glu Leu Asn Glu Ala lie Pro Asn Asp Glu Arg 
165 170 175 



Asp Thr Thr Met Pro Ala Ala Met Ala Thr Thr Leu Arg Lys Leu Leu 
180 185 190 



Thr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gin Gin Leu lie Asp Trp 
195 200 205 



Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu Arg Ser Ala Leu Pro 
210 215 220 
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Ala Gly Trp Phe He Ala Asp Lys Ser Gly Ala Gly Glu Arg Gly Ser 
225 230 235 240 



Arg Gly He He Ala Ala Leu Gly Pro Asp Gly Lys Pro Ser Arg He 
245 250 255 



Val Val He Tyr Thr Thr Gly Ser Gin Ala Thr Met Asp Glu Arg Asn 
260 265 270 



Arg Gin He Ala Glu He Gly Ala Ser Leu He Lys His Trp 
275 280 285 



<210> 20 

<211> 7010 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered sequence from virus and plasmid 



<220> 

<221> LTR 

<222> (149) . . (737) 

<223> 5' MoMuSVLTR 

<220> 

<221> CDS 

<222> (1660) . . (2454) 

<223> Neomycin resistance 

<220> 

<221> promoter 

<222> (2806) . . (3150) 

<223> BLV promoter 

<220> 

<221> misc_f eature 

<222> (3170) . . (3194) 

<223> attBl 

<220> 

<221> CDS 

<222> (3236) . . (3955) 

<223> EYFP; enhanced yellow fluorescent protein 
<220> 

< 2 2 1 > mis c_f ea tur e 

<222> (3980) . . (4004) 

<223> attB2 
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<220> 

<221> LTR 

<222> (4056) . . (4649) 

<223> 3' MoMuSVLTR 

<400> 20 



gaattaattc 


ataccagatc 


accgaaaact 


gtcctccaaa 


tgtgtccccc 


tcacactccc 


60 


aaattcgcgg 


gcttctgcct 


cttagaccac 


tctaccctat 


tccccacact 


caccggagcc 


120 


aaagccgcgg 


cccttccgtt 


tctttgcttt 


tgaaagaccc 


cacccgtagg 


tggcaagcta 


180 


gcttaagtaa 


cgccactttg 


caaggcatgg 


aaaaatacat 


aactgagaat 


agaaaagttc 


240 


agatcaaggt 


caggaacaaa 


gaaacagctg 


aataccaaac 


aggatatctg 


tggtaagcgg 


300 


ttcctgcccc 


ggctcagggc 


caagaacaga 


tgagacagct 


gagtgatggg 


ccaaacagga 


360 


tatctgtggt 


aagcagttcc 


tgccccggct 


cggggccaag 


aacagatggt 


ccccagatgc 


420 


ggtccagccc 


tcagcagttt 


ctagtgaatc 


atcagatgtt 


tccagggtgc 


cccaaggacc 


480 


tgaaaatgac 


cctgtacctt 


atttgaacta 


accaatcagt 


tcgcttctcg 


cttctgttcg 


540 


cgcgcttccg 


ctctccgagc 


tcaataaaag 


agcccacaac 


ccctcactcg 


gcgcgccagt 


600 


cttccgatag 


actgcgtcgc 


ccgggtaccc 


gtattcccaa 


taaagcctct 


tgctgtttgc 


660 


atccgaatcg 


tggtctcgct 


gttccttggg 


agggtctcct 


ctgagtgatt 


gactacccac 


720 


gacgggggtc 


tttcatttgg 


gggctcgtcc 


gggatttgga 


gacccctgcc 


cagggaccac 


780 


cgacccacca 


ccgggaggta 


agctggccag 


caacttatct 


gtgtctgtcc 


gattgtctag 


840 


tgtctatgtt 


tgatgttatg 


cgcctgcgtc 


tgtactagtt 


agctaactag 


ctctgtatct 


900 


aqcqqacccq 


tggtggaact 


gacgagttct 


gaacacccgg 


ccgcaaccct 


gggagacgtc 


960 


ccaaaaactt 


taagggccgt 


ttttgtggcc 


cgacctgagg 


aagggagtcg 


atgtggaatc 


1020 


cgaccccgtc 


aggatatgtg 


attctggtag 


gagacgagaa 


cctaaaacag 


ttcccgcctc 


1080 


cert" pt* craat t 


tttactttca 


atttaaaacc 


gaagccgcgc 


gtcttgtctg 


ctgcagcgct 


1140 


gcagcatcgt 


tctgtgttgt 


ctctgtctga 


ctgtgtttct 


gtatttgtct 


gaaaattagg 


1200 


gccagactgt 


taccactccc 


ttaagtttga 


ccttaggtca 


ctggaaagat 


gtcgagcgga 


1260 


tcgctcacaa 


ccagtcggta 


gatgtcaaga 


agagacgttg 


ggttaccttc 


tgctctgcag 


1320 


aatggccaac 


ctttaacgtc 


ggatggccgc 


gagacggcac 


ctttaaccga 


gacctcatca 


1380 


cccaggttaa 


gatcaaggtc 


ttttcacctg 


gcccgcatgg 


acacccagac 


caggtcccct 


1440 


acatcgtgac 


ctgggaagcc 


ttggcttttg 


acccccctcc 


ctgggtcaag 


ccctttgtac 


1500 


accctaagcc 


tccgcctcct 


cttcctccat 


ccgccccgtc 


tctccccctt 


gaacctcctc 


1560 
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gttcgacccc gcctcgatcc tccctttatc cagccctcac tccttctcta ggcgccggaa 1620 



ttccgatctg atcaagagac aggatgagga tcgtttcgc atg att gaa caa gat 1674 

Met He Glu Gin Asp 
1 5 

gga ttg cac gca ggt tct ccg gcc get tgg gtg gag agg eta ttc ggc 1722 
Gly Leu His Ala Gly Ser Pro Ala Ala Trp Val Glu Arg Leu Phe Gly 
10 15 20 

tat gac tgg gca caa cag aca ate ggc tgc tct gat gcc gcc gtg ttc 1770 
Tyr Asp Trp Ala Gin Gin Thr He Gly Cys Ser Asp Ala Ala Val Phe 
25 30 35 

egg ctg tea gcg cag ggg cgc ccg gtt ctt ttt gtc aag acc gac ctg 1818 
Arg Leu Ser Ala Gin Gly Arg Pro Val Leu Phe Val Lys Thr Asp Leu 
40 45 50 

tec ggt gcc ctg aat gaa ctg cag gac gag gca gcg egg eta teg tgg 1866 
Ser Gly Ala Leu Asn Glu Leu Gin Asp Glu Ala Ala Arg Leu Ser Trp 
55 60 65 

ctg gcc acg acg ggc gtt cct tgc gca get gtg etc gac gtt gtc act 1914 
Leu Ala Thr Thr Gly Val Pro Cys Ala Ala Val Leu Asp Val Val Thr 
70 75 80 85 

gaa gcg gga agg gac tgg ctg eta ttg ggc gaa gtg ccg ggg cag gat 1962 
Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu Val Pro Gly Gin Asp 
90 95 100 

etc ctg tea tct cac ctt get cct gcc gag aaa gta tec ate atg get 2010 
Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys Val Ser He Met Ala 
105 110 115 

gat gca atg egg egg ctg cat acg ctt gat ccg get acc tgc cca ttc 2058 
Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro Ala Thr Cys Pro Phe 
120 125 130 

gac cac caa gcg aaa cat cgc ate gag cga gca cgt act egg atg gaa 2106 
Asp His Gin Ala Lys His Arg He Glu Arg Ala Arg Thr Arg Met Glu 
135 140 145 

gcc ggt ctt gtc gat cag gat gat ctg gac gaa gag cat cag ggg etc 2154 
Ala Gly Leu Val Asp Gin Asp Asp Leu Asp Glu Glu His Gin Gly Leu 
150 155 160 165 

gcg cca gcc gaa ctg ttc gcc agg etc aag gcg cgc atg ccc gac ggc 2202 
Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala Arg Met Pro Asp Gly 
170 175 180 

gag gat etc gtc gtg acc cat ggc gat gcc tgc ttg ccg aat ate atg 2250 
Glu Asp Leu Val Val Thr His Gly Asp Ala Cys Leu Pro Asn He Met 
185 190 195 

gtg gaa aat ggc cgc ttt tct gga ttc ate gac tgt ggc egg ctg ggt 2298 
Val Glu Asn Gly Arg Phe Ser Gly Phe He Asp Cys Gly Arg Leu Gly 
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200 



205 



210 



gtg gcg gac cgc tat cag gac ata gcg ttg get acc cgt gat att get 2346 
Val Ala Asp Arg Tyr Gin Asp lie Ala Leu Ala Thr Arg Asp lie Ala 
215 220 225 

gaa gag ctt ggc ggc gaa tgg get gac cgc ttc etc gtg ctt tac ggt 2394 
Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe Leu Val Leu Tyr Gly 
230 235 240 245 

ate gee get ccc gat teg cag cgc ate gee ttc tat cgc ctt ctt gac 2442 
lie Ala Ala Pro Asp Ser Gin Arg lie Ala Phe Tyr Arg Leu Leu Asp 
250 255 260 

gag ttc ttc tga gegggactet ggggttcgaa atgaccgacc aagcgacgcc 2494 
Glu Phe Phe 



caacctgcca tcacgagatt tcgattccac cgccgccttc tatgaaaggt tgggcttegg 2554 

aatcgttttc cgggacgccg gctggatgat cctccagcgc ggggatctca tgctggagtt 2614 

cttcgcccac cccgggctcg atcccctcgc gagttggttc agctgctgcc tgaggctgga 2674 

cgacctcgcg gagttctacc ggcagtgcaa atccgtcggc atccaggaaa ccagcagcgg 2734 

ctatccgcgc atccatgccc ccgaactgca ggagtgggga ggcacgatgg ccgctttggt 2794 

egaggeggat cctagcagaa aaataagact tgattccccc ttaaaattac aactgetaga 2854 

aaatgaatgg ctctcccgcc ttttttgagg gggaatcatt tgtatgaaag ateatgeega 2914 

cctaggcgcc gccaccgccc cgtaaaccag acagagacgt cagctgccag aaaagctggt 2974 

gaeggcaget ggtggctaga atccccgtac ctccccaact tcccctttcc cgaaaaatcc 3034 

acaccctgag ctgctgacct cacctgctga taaattaata aaatgeegge cctgtcgagt 3094 

tagcggcacc agaagegtte ttctcctgag accctcgtgc tcagctctcg gtcctgcctc 3154 

gagaagcttg ttatcacaag tttgtacaaa aaagcaggct tcgaaggaga tagaaccaat 3214 

tctctaagga aatacttaac c atg gtg age aag ggc gag gag ctg ttc acc 3265 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr 
265 270 

ggg gtg gtg ccc ate ctg gtc gag ctg gac ggc gac gta aac ggc cac 3313 
Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
275 280 285 290 

aag ttc age gtg tec ggc gag ggc gag ggc gat gec acc tac ggc aag 3361 
Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
295 300 305 

ctg acc ctg aag ttc ate tgc acc acc ggc aag ctg ccc gtg ccc tgg 3409 
Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
310 315 320 
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ccc acc etc gtg acc acc ttc ggc tac ggc ctg cag tgc ttc gec cgc 
Pro Thr Leu Val Thr Thr Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg 
325 330 335 



3457 



tac ccc gac cac atg aag cag cac gac ttc ttc aag tec gec atg ccc 3505 
Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 
340 345 350 

gaa ggc tac gtc cag gag cgc acc ate ttc ttc aag gac gac ggc aac 3553 
Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn 
355 360 365 370 

tac aag acc cgc gee gag gtg aag ttc gag ggc gac acc ctg gtg aac 3601 
Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
375 380 385 

cgc ate gag ctg aag ggc ate gac ttc aag gag gac ggc aac ate ctg 3649 
Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu 
390 395 400 

ggg cac aag ctg gag tac aac tac aac age cac aac gtc tat ate atg 3697 
Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met 
405 410 415 

gec gac aag cag aag aac ggc ate aag gtg aac ttc aag ate cgc cac 3745 
Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His 
420 425 430 

aac ate gag gac ggc age gtg cag etc gee gac cac tac cag cag aac 3793 
Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
435 440 445 450 

acc ccc ate ggc gac ggc ccc gtg ctg ctg ccc gac aac cac tac ctg 3841 
Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
455 460 465 

age tac cag tec gee ctg age aaa gac ccc aac gag aag cgc gat cac 3889 
Ser Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
470 475 480 

atg gtc ctg ctg gag ttc gtg acc gee gee ggg ate act etc ggc atg 3937 
Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met 
485 490 495 

gac gag ctg tac aag taa agcggccgca ctcgagatat ctagacccag 3985 
Asp Glu Leu Tyr Lys 
500 

ctttcttgta caaagtggtg ataacatcga taaaataaaa gattttattt agtctccaga 4045 

aaaagggggg aatgaaagac cccacctgta ggtttggcaa gctagcttaa gtaacgecat 4105 

tttgeaagge atggaaaaat acataactga gaatagagaa gttcagatca aggtcaggaa 4165 

cagatggaac agctgaatat gggecaaaca ggatatctgt ggtaagcagt tcctgccccg 4225 
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gctcagggcc 


aagaacagat 


ggaacagctg 


aatatgggcc 


aaacaggata 


tctgtggtaa 


4285 


gcagttcctg 


ccccggctca 


gggccaagaa 


cagatggtcc 


ccagatgcgg 


tccagccctc 


4345 


agcagtttct 


agagaaccat 


cagatgtttc 


cagggtgccc 


caaggacctg 


aaatgaccct 


4405 


gtgccttatt 


tgaactaacc 


aatcagttcg 


cttctcgctt 


ctgttcgcgc 


gcttctgctc 


4465 


cccgagctca 


ataaaagagc 


ccacaacccc 


tcactcgggg 


cgccagtcct 


ccgattgact 


4525 


gagtcgcccg 


ggtacccgtg 


tatccaataa 


accctcttgc 


agttgcatcc 


gacttgtggt 


4585 


ctcgctgttc 


cttgggaggg 


tctcctctga 


gtgattgact 


acccgtcagc 


gggggtcttt 


4645 


catttggggg 


ctcgtccggg 


atcgggagac 


ccctgcccag 


ggaccaccga 


cccaccaccg 


4705 


ggaggtaagc 


tggctgcctc 


gcgcgtttcg 


gtgatgacgg 


tgaaaacctc 


tgacacatgc 


4765 


agctcccgga 


gacggtcaca 


gcttgtctgt 


aagcggatgc 


cgggagcaga 


caagcccgtc 


4825 


agggcgcgtc 


agcgggtgtt 


ggcgggtgtc 


ggggcgcagc 


catgacccag 


tcacgtagcg 


4885 


atagcggagt 


gtatactggc 


ttaactatgc 


ggcatcagag 


cagattgtac 


tgagagtgca 


4945 


ccatatgcgg 


tgtgaaatac 


cgcacagatg 


cgtaaggaga 


aaataccgca 


tcaggcgctc 


5005 


ttccgcttcc 


tcgctcactg 


actcgctgcg 


ctcggtcgtt 


cggctgcggc 


gagcggtatc 


5065 


agctcactca 


aaggcggtaa 


tacggttatc 


cacagaatca 


ggggataacg 


caggaaagaa 


5125 


catgtgagca 


aaaggccagc 


aaaaggccag 


gaaccgtaaa 


aaggccgcgt 


tgctggcgtt 


5185 


tttccatagg 


ctccgccccc 


ctgacgagca 


tcacaaaaat 


cgacgctcaa 


gtcagaggtg 


5245 


gcgaaacccg 


acaggactat 


aaagatacca 


ggcgtttccc 


cctggaagct 


ccctcgtgcg 


5305 


ctctcctgtt 


ccgaccctgc 


cgcttaccgg 


atacctgtcc 


gcctttctcc 


cttcgggaag 


5365 


cgtggcgctt 


tctcatagct 


cacgctgtag 


gtatctcagt 


tcggtgtagg 


tcgttcgctc 


5425 


caagctgggc 


tgtgtgcacg 


aaccccccgt 


tcagcccgac 


cgctgcgcct 


tatccggtaa 


5485 


ctatcgtctt 


gagtccaacc 


cggtaagaca 


cgacttatcg 


ccactggcag 


cagccactgg 


5545 


taacaggatt 


agcagagcga 


ggtatgtagg 


cggtgctaca 


gagttcttga 


agtggtggcc 


5605 


taactacggc 


tacactagaa 


ggacagtatt 


tggtatctgc 


gctctgctga 


agccagttac 


5665 


cttcggaaaa 


agagttggta 


gctcttgatc 


cggcaaacaa 


accaccgctg 


gtagcggtgg 


5725 


tttttttgtt 


tgcaagcagc 


agattacgcg 


cagaaaaaaa 


ggatctcaag 


aagatccttt 




gatcttttct 


acggggtctg 


acgctcagtg 


gaacgaaaac 


tcacgttaag 


ggattttggt 


5845 


catgagatta 


tcaaaaagga 


tcttcaccta 


gatcctttta 


aattaaaaat 


gaagttttaa 


5905 


atcaatctaa 


agtatatatg 


agtaaacttg 


gtctgacagt 


taccaatgct 


taatcagtga 


5965 



ggcacctatc 


tcagcgatct 


gtctatttcg 


ttcatccata 


gttgcctgac 


tccccgtcgt 


6025 


gtagataact 


acgatacggg 


agggcttacc 


atctggcccc 


agtgctgcaa 


tgataccgcg 


6085 


agacccacgc 


tcaccggctc 


cagatttatc 


agcaataaac 


cagccagccg 


gaagggccga 


6145 


gcgcagaagt 


ggtcctgcaa 


ctttatccgc 


ctccatccag 


tctattaatt 


gttgccggga 


6205 


agctagagta 


agtagttcgc 


cagttaatag 


tttgcgcaac 


gttgttgcca 


ttgctgcagg 


6265 


catcgtggtg 


tcacgctcgt 


cgtttggtat 


ggcttcattc 


agctccggtt 


cccaacgatc 


6325 


aaggcgagtt 


acatgatccc 


ccatgttgtg 


caaaaaagcg 


gttagctcct 


tcggtcctcc 


6385 


gatcgttgtc 


agaagtaagt 


tggccgcagt 


gttatcactc 


atggttatgg 


cagcactgca 


6445 


taattctctt 


actgtcatgc 


catccgtaag 


atgcttttct 


gtgactggtg 


agtactcaac 


6505 


caagtcattc 


tgagaatagt 


gtatgcggcg 


accgagttgc 


tcttgcccgg 


cgtcaacacg 


6565 


ggataatacc 


gcgccacata 


gcagaacttt 


aaaagtgctc 


atcattggaa 


aacgttcttc 


6625 


ggggcgaaaa 


ctctcaagga 


tcttaccgct 


gttgagatcc 


agttcgatgt 


aacccactcg 


6685 


tgcacccaac 


tgatcttcag 


catcttttac 


tttcaccagc 


gtttctgggt 


gagcaaaaac 


6745 


aggaaggcaa 


aatgccgcaa 


aaaagggaat 


aagggcgaca 


cggaaatgtt 


gaatactcat 


6805 


actcttcctt 


tttcaatatt 


attgaagcat 


ttatcagggt 


tattgtctca 


tgagcggata 


6865 


catatttgaa 


tgtatttaga 


aaaataaaca 


aataggggtt 


ccgcgcacat 


ttccccgaaa 


6925 


agtgccacct 


gacgtctaag 


aaaccattat 


tatcatgaca 


ttaacctata 


aaaataggcg 


6985 


tatcacgagg 


ccctttcgtc 


ttcaa 
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<210> 21 

<211> 264 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 21 

Met lie Glu Gin Asp Gly Leu His Ala Gly Ser Pro Ala Ala Trp Val 
15 10 15 

Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gin Gin Thr lie Gly Cys Ser 
20 25 30 

Asp Ala Ala Val Phe Arg Leu Ser Ala Gin Gly Arg Pro Val Leu Phe 

48 



35 



40 



45 



Val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gin Asp Glu Ala 
50 55 60 



Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly Val Pro Cys Ala Ala Val 
65 ~ 70 75 80 



Leu Asp Val Val Thr Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu 
85 90 95 



Val Pro Gly Gin Asp Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys 
100 105 110 



Val Ser He Met Ala Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro 
115 120 125 



Ala Thr Cys Pro Phe Asp His Gin Ala Lys His Arg He Glu Arg Ala 
130 135 140 



Arg Thr Arg Met Glu Ala Gly Leu Val Asp Gin Asp Asp Leu Asp Glu 
145 150 155 160 



Glu His Gin Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala 
165 170 175 



Arg Met Pro Asp Gly Glu Asp Leu Val Val Thr His Gly Asp Ala Cys 
180 185 190 



Leu Pro Asn He Met Val Glu Asn Gly Arg Phe Ser Gly Phe He Asp 
195 200 205 



Cys Gly Arg Leu Gly Val Ala Asp Arg Tyr Gin Asp He Ala Leu Ala 
210 215 220 



Thr Arg Asp He Ala Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe 
225 230 235 240 



Leu Val Leu Tyr Gly He Ala Ala Pro Asp Ser Gin Arg He Ala Phe 
245 250 255 



Tyr Arg Leu Leu Asp Glu Phe Phe 
2 60 
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<210> 22 

<211> 239 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 22 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 



Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 



lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 



Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 



Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
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180 



185 



Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 23 

<211> 7010 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered sequence form virus and plasmid 



<220> 

<221> CDS 

<222> (206) . . (1066) 

<223> Ampicillin resistance 

<400> 23 

ttgaagacga aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat 60 

ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt 120 

atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct 180 

tcaataatat tgaaaaagga agagt atg agt att caa cat ttc cgt gtc gcc 232 

Met Ser lie Gin His Phe Arg Val Ala 
1 5 

ctt att ccc ttt ttt gcg gca ttt tgc ctt cct gtt ttt get cac cca 280 
Leu lie Pro Phe Phe Ala Ala Phe Cys Leu Pro Val Phe Ala His Pro 
10 15 20 25 

gaa acg ctg gtg aaa gta aaa gat get gaa gat cag ttg ggt gca cga 328 
Glu Thr Leu Val Lys Val Lys Asp Ala Glu Asp Gin Leu Gly Ala Arg 
30 35 40 

gtg ggt tac ate gaa ctg gat etc aac age ggt aag ate ctt gag agt 376 
Val Gly Tyr lie Glu Leu Asp Leu Asn Ser Gly Lys lie Leu Glu Ser 
45 50 55 

ttt cgc ccc gaa gaa cgt ttt cca atg atg age act ttt aaa gtt ctg 424 
Phe Arg Pro Glu Glu Arg Phe Pro Met Met Ser Thr Phe Lys Val Leu 
60 65 70 

eta tgt ggc gcg gta tta tec cgt gtt gac gcc ggg caa gag caa etc 472 
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Leu Cys Gly Ala Val Leu Ser Arg Val Asp Ala Gly Gin Glu Gin Leu 
75 80 85 



ggt cgc cgc ata cac tat tct cag aat gac ttg gtt gag tac tea cca 520 
Gly Arg Arg lie His Tyr Ser Gin Asn Asp Leu Val Glu Tyr Ser Pro 
90 95 100 105 

gtc aca gaa aag cat ctt acg gat ggc atg aca gta aga gaa tta tgc 568 
Val Thr Glu Lys His Leu Thr Asp Gly Met Thr Val Arg Glu Leu Cys 
110 115 120 

agt get gec ata acc atg agt gat aac act gcg gec aac tta ctt ctg 616 
Ser Ala Ala lie Thr Met Ser Asp Asn Thr Ala Ala Asn Leu Leu Leu 
125 130 135 

aca acg ate gga gga ccg aag gag eta acc get ttt ttg cac aac atg 664 
Thr Thr lie Gly Gly Pro Lys Glu Leu Thr Ala Phe Leu His Asn Met 
140 145 150 

ggg gat cat gta act cgc ctt gat cgt tgg gaa ccg gag ctg aat gaa 712 
Gly Asp His Val Thr Arg Leu Asp Arg Trp Glu Pro Glu Leu Asn Glu 
155 160 165 

gee ata cca aac gac gag cgt gac acc acg atg cct gca gca atg gca 760 
Ala lie Pro Asn Asp Glu Arg Asp Thr Thr Met Pro Ala Ala Met Ala 
170 175 180 185 

aca acg ttg cgc aaa eta tta act ggc gaa eta ctt act eta get tec 808 
Thr Thr Leu Arg Lys Leu Leu Thr Gly Glu Leu Leu Thr Leu Ala Ser 
190 195 200 

egg caa caa tta ata gac tgg atg gag gcg gat aaa gtt gca gga cca 856 
Arg Gin Gin Leu lie Asp Trp Met Glu Ala Asp Lys Val Ala Gly Pro 
205 210 215 

ctt ctg cgc teg gec ctt ccg get ggc tgg ttt att get gat aaa tct 904 
Leu Leu Arg Ser Ala Leu Pro Ala Gly Trp Phe lie Ala Asp Lys Ser 
220 225 230 

gga gee ggt gag cgt ggg tct cgc ggt ate att gca gca ctg ggg cca 952 
Gly Ala Gly Glu Arg Gly Ser Arg Gly lie lie Ala Ala Leu Gly Pro 
235 240 245 

gat ggt aag ccc tec cgt ate gta gtt ate tac acg acg ggg agt cag 1000 
Asp Gly Lys Pro Ser Arg lie Val Val lie Tyr Thr Thr Gly Ser Gin 
250 255 260 265 

gca act atg gat gaa cga aat aga cag ate get gag ata ggt gec tea 1048 
Ala Thr Met Asp Glu Arg Asn Arg Gin lie Ala Glu lie Gly Ala Ser 
270 275 280 

ctg att aag cat tgg taa ctgtcagacc aagtttactc atatatactt 1096 
Leu lie Lys His Trp 
285 

tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1156 
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aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1216 

gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1276 

acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1336 

tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag 1396 

ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1456 

atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1516 

agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1576 

cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1636 

agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1696 

acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1756 

gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1816 

ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1876 

gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1936 

gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1996 

gaagcggaag agcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac 2056 

cgcatatggt gcactctcag tacaatctgc tctgatgccg catagttaag ccagtataca 2116 

ctccgctatc gctacgtgac tgggtcatgg ctgcgccccg acacccgcca acacccgctg 2176 

acgcgccctg acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct 2236 

ccgggagctg catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg aggcagccag 2296 

cttacctccc ggtggtgggt cggtggtccc tgggcagggg tctcccgatc ccggacgagc 2356 

ccccaaatga aagacccccg ctgacgggta gtcaatcact cagaggagac cctcccaagg 2416 

aacagcgaga ccacaagtcg gatgcaactg caagagggtt tattggatac acgggtaccc 2476 

gggcgactca gtcaatcgga ggactggcgc cccgagtgag gggttgtggg ctcttttatt 2536 

gagctcgggg agcagaagcg cgcgaacaga agcgagaagc gaactgattg gttagttcaa 2596 

ataaggcaca gggtcatttc aggtccttgg ggcaccctgg aaacatctga tggttctcta 2656 

gaaactgctg agggctggac cgcatctggg gaccatctgt tcttggccct gagccggggc 2716 

aggaactgct taccacagat atcctgtttg gcccatattc agctgttcca tctgttcttg 2776 

gccctgagcc ggggcaggaa ctgcttacca cagatatcct gtttggccca tattcagctg 2836 

ttccatctgt tcctgacctt gatctgaact tctctattct cagttatgta tttttccatg 2896 
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ccttgcaaaa 


tggcgttact 


taagctagct 


tgccaaacct 


acaggtgggg 


tctttcattc 


2956 


cccccttttt 


ctggagacta 


aataaaatct 


tttattttat 


cgatgttatc 


accactttgt 


3016 


acaagaaagc 


tgggtctaga 


tatctcgagt 


gcggccgctt 


tacttgtaca 


gctcgtccat 


3076 


gccgagagtg 


atcccggcgg 


cggtcacgaa 


ctccagcagg 


accatgtgat 


cgcgcttctc 


3136 


gttggggtct 


ttgctcaggg 


cggactggta 


gctcaggtag 


tggttgtcgg 


gcagcagcac 


3196 


ggggccgtcg 


ccgatggggg 


tgttctgctg 


gtagtggtcg 


gcgagctgca 


cgctgccgtc 


3256 


ctcgatgttg 


tggcggatct 


tgaagttcac 


cttgatgccg 


ttcttctgct 


tgtcggccat 


3316 


gatatagacg 


ttgtggctgt 


tgtagttgta 


ctccagcttg 


tgccccagga 


tgttgccgtc 


3376 


ctccttgaag 


tcgatgccct 


tcagctcgat 


gcggttcacc 


agggtgtcgc 


cctcgaactt 


3436 


cacctcggcg 


cgggtcttgt 


agttgccgtc 


gtccttgaag 


aagatggtgc 


gctcctggac 


3496 


gtagccttcg 


ggcatggcgg 


acttgaagaa 


gtcgtgctgc 


ttcatgtggt 


cggggtagcg 


3556 


ggcgaagcac 


tgcaggccgt 


agccgaaggt 


ggtcacgagg 


gtgggccagg 


gcacgggcag 


3616 


cttgccggtg 


gtgcagatga 


acttcagggt 


cagcttgccg 


taggtggcat 


cgccctcgcc 


3676 


ctcgccggac 


acgctgaact 


tgtggccgtt 


tacgtcgccg 


tccagctcga 


ccaggatggg 


3736 


caccaccccg 


gtgaacagct 


cctcgccctt 


gctcaccatg 


gttaagtatt 


tccttagaga 


3796 


attggttcta 


tctccttcga 


agcctgcttt 


tttgtacaaa 


cttgtgataa 


caagcttctc 


3856 


gaggcaggac 


cgagagctga 


gcacgagggt 


ctcaggagaa 


gaacgcttct 


ggtgccgcta 


3916 


actcgacagg 


gccggcattt 


tattaattta 


tcagcaggtg 


aggtcagcag 


ctcagggtgt 


3976 


ggatttttcg 


ggaaagggga 


agttggggag 


gtacggggat 


tctagccacc 


agctgccgtc 


4036 


accagctttt 


ctggcagctg 


acgtctctgt 


ctggtttacg 


gggcggtggc 


ggcgcctagg 


4096 


tcggcatgat 


ctttcataca 


aatgattccc 


cctcaaaaaa 


ggcgggagag 


ccattcattt 


4156 


tctagcagtt 


gtaattttaa 


gggggaatca 


agtcttattt 


ttctgctagg 


atccgcctcg 


4216 


accaaagcgg 


ccatcgtgcc 


tccccactcc 


tgcagttcgg 


gggcatggat 


gcgcggatag 


4276 


ccgctgctgg 


tttcctggat 


gccgacggat 


ttgcactgcc 


ggtagaactc 


cgcgaggtcg 


4336 


tccagcctca 


ggcagcagct 


gaaccaactc 


gcgaggggat 


cgagcccggg 


gtgggcgaag 


4396 


aactccagca 


tgagatcccc 


gcgctggagg 


atcatccagc 


cggcgtcccg 


gaaaacgatt 


4456 


ccgaagccca 


acctttcata 


gaaggcggcg 


gtggaatcga 


aatctcgtga 


tggcaggttg 


4516 


ggcgtcgctt 


ggtcggtcat 


ttcgaacccc 


agagtcccgc 


tcagaagaac 


tcgtcaagaa 


4576 



rrrrrTra t~ a era a 

y y l. ay aa 


yy^-yw ^y^y 


tacaaatcaa 


oaac aac era t 


acegtaaage 


acgaggaagc 


4636 


crcrt" cacicccpi 
yy L-fc.ayfc-.fc. La 


r r pcrppcrppa 

v— Luy vvy v#uu 


aactcttcacr 


caatatcacg 


ggtagecaac 


gctatgtcct 


4696 


era t* a erperert* p 


pcrppapappp 


agccggccac 


aatcaatoaa 


tccagaaaag 


eggecatttt 


4756 


fc. * — CI fc- fc- Ci V— y CI fc, 


ci i_ tuyyuauy 


caggcatcgc 


caterercrtcac 


gacgagatcc 


tcqccatCQQ 

fc.fc-yv^>->y ^ w yy 


4816 


crpa 1 - err* err* err* 

y fc— c-i fc-y ^-*y id 


cttaaacctci 

X»* \m- W >^ X**. X-* X-* U» XJ 


gcgaacagtt 


caactaacQC 


gagcccctga 


tgctcttcgt 


4876 


era rra t" pa t" p 
L<uciy u tva, C. fc- 


ctaatcaaca 

X-* W XJ l-A, V^i» X^ W4. X-* 


agaceggett 


ccatccgagt 


acgtgctcgc 


tegatgegat 


4936 


ert" t~ t~ coot' t" cr 


cr f crcr t p era a t 

y i»yy tuyaa fc. 




cccrcratcaacj 


egtatgeage 


cgccgcattg 


4996 


C>p\ t~ CaCTCCa t~ 


aa t crcra t" a p t 


ttctcaacacr 


aaacaaaQtcj 


agatgacagg 


agatcctgcc 


5056 


rrcfacacttc 

fc» ~J y fc-fc*.fc* V- fc* 


gcccaatagc 


agccagtccc 


ttcccgcttc 


agtgacaacg 


tcgagcacag 


5116 




aacgcccgtc 


ataaccaacc 


aegatagecg 


cgctgcctcg 


tectgeagtt 


5176 


par f pacrcrcrp 


a c c crca c a era 

c*. v* fc» y y c*. v* y y 


tcoatcttcra 


caaaaagaac 


cgggcgcccc 


tgegctgaca 


5236 


yLLyyciGifc-Cifc- 


crcrpercrpa f pa 


oaerc acre c era 


ttatctatta 


tgcccagtca 


tagecgaata 


5296 




L L aay L y y v_» l 


crcra era a p p t" cr 
y y dy ci ci ^ y 


pcrt"crpaat* pp 

vy v.. y \^ fc* u. %^ ^ 


atcttattca 

fcA fc- fc» fc- fc- fcj fc* fc- V— fc* 


atcatcfcaaa 


5356 


d v— y Cl L. fc. fc. LLa 


t" PPTCrf Pf PF 


t era t p aoa fc c 


ggaat t cegg 


cocctaaaaa 


aaaaatQaQcr 

fc*i^yfc*y ^s*-*i7l7 


5416 


C1C t~ nffr! t~ a a 3 

y v_ Lyya *— ci cici 


cr crcra crcra t" per 

y y y c*.y y o fc- v-y 


aaaccrciaatc 


aaacaaacraa 


attcaaaaqcf 


aaqaqacqger 

yfc*yfc*y«-*fc-yyy 


5476 


y v-yyauyyay 


era a era crcra crcr 
y aay ay y ay y 


pereracrcrpt** t*a 


aaa tatacaa 


aaaacttaac 

cty y y fc- fc- fc»yfc*fc> 


ccaaaaaaaa 

fc.fc-.uyyyu.yyy 


5536 


yyy L-L-aaaay 


c* c pi fx ^T<^T , t" t~ c 


p pa cr err pa per 


a t" crt" acrcrcraa 
*-*■— y La yyyy a 


p p t crcr t p t crcr 

fc»fc»fc»yy fc»v» fcy y 


crtatccatQC 

^3 ^3 v* w x^ x^ 


5596 


yyyutayy ty 


a a a a era p p t" t* 


era tp t taacc 


t crcrcr t aa t era 


crcrtctccrcftt 

yy fc- fc. fc fc* y y fc fc- 


aaaggtgccg 


5656 


t" pt~ one a arc* 


a t" ppcra pa tt 


aaaaottacic 


cattctgeag 


aocacraaocit 


aacccaacgt 


5716 


prprrpfr era 

fc. L. V- I- fc* L fc- Ly Cl 


pa t" p 1" a p pcra 


ptcrcrttQtaa 


gcgatccgct 


cgacatcttt 


ccagtgacct 


5776 


aayy L-fc-aaafc. 


t" t" a a nercra ert~ 

ti-aayyyay u 


crcrt*aapacrtp 


tcrcrppptaat 


t ttcacracaa 


atacagaaac 


5836 


=s rrt" pa era c 


a era era pa a pa 
cl y ciy c*. c* c*. \_ c*. 


p a era a pcra t cr 


p tcrpaapcrpt 


gcagcagaca 


aaaccrccrccrQ 

fc*y fc*fc> y fc- y fc- y y 


5896 


c l. ucy y LLtt 


aaa^t_ycicicxy 


caaaaat"t*c , a 


era per era crcr per 
yo^yyoyy^y 


crcra apt* crt" t~ r 

yyuafc fc y fc* fc- fc- 


t* an crt" t rt per 

fc-ciy y fc, fc- fc- fc. fc. y 


5956 




yaaL*uai~ai~a 


t - cc t* era ccrcrcr 
lll LyciL.yyy 


crt" per era t" t" pp 
yu^yya.Luv_^ 


ci V— c*. fc- fc-. y ct u. v_ 


ppt"t"ppt"pacr 

fc- fc-* L- L- fc- fc- fc. fc- ci y 


6016 


y UCyyy CCcxC 


aaaaaCygCC 


ppfaaanhrr 
CCCaaay L.v^U 


ctyggdfc.yt.fc. 


Lfc.Lfc.ayyy l* l* 


ycgyccgygL. 


ov / u 


gttcagaact 


cgtcagttcc 


accacgggtc 


cgccagatac 


agagctagtt 


agctaactag 


6136 


tacagacgea 


ggegcataac 


atcaaacata 


gacactagac 


aatcggacag 


acacagataa 


6196 


gttgctggcc 


agcttacctc 


ccggtggtgg 


gtcggtggtc 


cctgggcagg 


ggtctccaaa 


6256 


tcccggacga 


gcccccaaat 


gaaagacccc 


cgtcgtgggt 


agtcaatcac 


tcagaggaga 


6316 



ccctcccaag 


gaacagcgag 


accacgattc 


ggatgcaaac 


agcaagaggc 


tttattggga 


6376 


atacgggtac 


ccgggcgacg 


cagtctatcg 


gaagactggc 


gcgccgagtg 


aggggttgtg 


6436 


ggctctttta 


ttgagctcgg 


agagcggaag 


cgcgcgaaca 


gaagcgagaa 


gcgaactgat 


6496 


tggttagttc 


aaataaggta 


cagggtcatt 


ttcaggtcct 


tggggcaccc 


tggaaacatc 


6556 


tgatgattca 


ctagaaactg 


ctgagggctg 


gaccgcatct 


ggggaccatc 


tgttcttggc 


6616 


cccgagccgg 


ggcaggaact 


gcttaccaca 


gatatcctgt 


ttggcccatc 


actcagctgt 


6676 


ctcatctgtt 


cttggccctg 


agccggggca 


ggaaccgctt 


accacagata 


tcctgtttgg 


6736 


tattcagctg 


tttctttgtt 


cctgaccttg 


atctgaactt 


ttctattctc 


agttatgtat 


6796 


ttttccatgc 


cttgcaaagt 


ggcgttactt 


aagctagctt 


gccacctacg 


ggtggggtct 


6856 


ttcaaaagca 


aagaaacgga 


agggccgcgg 


ctttggctcc 


ggtgagtgtg 


gggaataggg 


6916 


tagagtggtc 


taagaggcag 


aagcccgcga 


atttgggagt 


gtgaggggga 


cacatttgga 


6976 


ggacagtttt 


cggtgatctg 


gtatgaatta 


attc 






7010 



<210> 24 

<211> 286 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 24 

Met Ser lie Gin His Phe Arg Val Ala Leu lie Pro Phe Phe Ala Ala 
15 10 15 

Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr Leu Val Lys Val Lys 
20 25 30 

Asp Ala Glu Asp Gin Leu Gly Ala Arg Val Gly Tyr lie Glu Leu Asp 
35 40 45 

Leu Asn Ser Gly Lys lie Leu Glu Ser Phe Arg Pro Glu Glu Arg Phe 
50 55 60 

Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys Gly Ala Val Leu Ser 
65 70 75 80 

Arg Val Asp Ala Gly Gin Glu Gin Leu Gly Arg Arg lie His Tyr Ser 

56 



85 



90 



95 



Gin Asn Asp Leu Val Glu Tyr Ser Pro Val Thr Glu Lys His Leu Thr 
100 105 110 



Asp Gly Met Thr Val Arg Glu Leu Cys Ser Ala Ala lie Thr Met Ser 
115 120 125 



Asp Asn Thr Ala Ala Asn Leu Leu Leu Thr Thr lie Gly Gly Pro Lys 
130 135 140 



Glu Leu Thr Ala Phe Leu His Asn Met Gly Asp His Val Thr Arg Leu 
145 150 155 160 



Asp Arg Trp Glu Pro Glu Leu Asn Glu Ala lie Pro Asn Asp Glu Arg 
165 170 175 



Asp Thr Thr Met Pro Ala Ala Met Ala Thr Thr Leu Arg Lys Leu Leu 
180 185 190 



Thr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gin Gin Leu lie Asp Trp 
195 200 205 



Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu Arg Ser Ala Leu Pro 
210 215 220 



Ala Gly Trp Phe lie Ala Asp Lys Ser Gly Ala Gly Glu Arg Gly Ser 
225 230 235 240 



Arg Gly lie lie Ala Ala Leu Gly Pro Asp Gly Lys Pro Ser Arg lie 
245 250 255 



Val Val lie Tyr Thr Thr Gly Ser Gin Ala Thr Met Asp Glu Arg Asn 
260 265 270 



Arg Gin He Ala Glu He Gly Ala Ser Leu He Lys His Trp 
275 280 285 



<210> 25 

<211> 7121 

<212> DNA 

<213> Artificial 

<220> 



57 



<223> Engineered sequence from virus and plasmid 



<220> 

<221> LTR 

<222> (149) . . (737) 

<223> 5* MoMuSVLTR 

<220> 

<221> CDS 

<222> (1660) (2454) 

<223> Neomycin resistance 

<220> 

<221> promoter 

<222> (2806) . . (3261) 

<223> HIV promoter 

<220> 

<2 2 1 > mi sc_f eatur e 

<222> (3281) . . (3305) 

<223> attBl 

<220> 

<221> CDS 

<222> (3347) . . (4066) 

<223> EYFP; enhanced yellow fluorescent protein 
<220> 

<221> misc_f eature 

<222> (4091) . . (4115) 

<223> attB2 

<220> 

<221> LTR 

<222> (4167) . . (4760) 

<223> 3 ' MoMuLVLTR 

<400> 25 



gaattaattc 


ataccagatc 


accgaaaact 


gtcctccaaa 


tgtgtccccc 


tcacactccc 


60 


aaattcgcgg 


gcttctgcct 


cttagaccac 


tctaccctat 


tccccacact 


caccggagcc 


120 


aaagccgcgg 


cccttccgtt 


tctttgcttt 


tgaaagaccc 


cacccgtagg 


tggcaagcta 


180 


gcttaagtaa 


cgccactttg 


caaggcatgg 


aaaaatacat 


aactgagaat 


agaaaagttc 


240 


agatcaaggt 


caggaacaaa 


gaaacagctg 


aataccaaac 


aggatatctg 


tggtaagcgg 


300 


ttcctgcccc 


ggctcagggc 


caagaacaga 


tgagacagct 


gagtgatggg 


ccaaacagga 


360 


tatctgtggt 


aagcagttcc 


tgccccggct 


cggggccaag 


aacagatggt 


ccccagatgc 


420 


ggtccagccc 


tcagcagttt 


ctagtgaatc 


atcagatgtt 


tccagggtgc 


cccaaggacc 


480 


tgaaaatgac 


cctgtacctt 


atttgaacta 


accaatcagt 


tcgcttctcg 


cttctgttcg 


540 



58 



cacac t fccccr 

>w* y v-u l. \«» 


ctctccgagc 


tcaataaaag 


agcccacaac 


ccctcactcg 


gcgcgccagt 


600 


cttccaataa 


actcrcatccrc 


ccgggtaccc 


gtattcccaa 


taaagectet 


tgctgtttgc 


660 


at~ rraaatca 


tggtctcget 


Qttccttggg 

ZJ ^ ^ ZJ ZJ Z3 


agggtctcct 


ctgagtgatt 


gactacccac 


720 


era cacrcKjat c 


tttcatttgg 


gggctcgtcc 

ZD ZJ ZJ ZJ 


gggatttgga 


gacccctgcc 


cagggaccac 


780 


cgacccacca 


ccaacraQcrta 


agctggccag 


caacttatct 


gtgtctgtcc 


gattgtctag 


840 


tatctatatt 


taatattatq 


cgcctgcgtc 


tgtactagtt 


agctaactag 


ctctgtatct 


900 


acrcoGaccccr 


taatacraact 


gacgagttct 


gaacacccgg 


ccgcaaccct 


gggagacgtc 


960 


ccagggact t 


taaaaqccqt 

Z2z3z3z3z3^^z? 


ttttgtggcc 


cgacctgagg 


aagggagtcg 


atgtggaatc 


1020 


ccraccccotc 


aaaatatatcr 


gttctggtag 


gagacgagaa 


cctaaaacag 


ttcccgcctc 


1080 


catctaaatt 


tttgettteg 


gtttggaacc 


gaagccgcgc 


gtcttgtctg 


ctgcagcgct 


1140 


err* acre at cot 

y v.- ciy ^«auv>y v» 


tctatattot 


ctctgtctga 


ctgtgtttct 


gtatttgtct 


gaaaattagg 


1200 


nr r* a era e to fc. 


taccactccc 


ttaagtttga 


ccttaggtca 


ctggaaagat 


gtcgagcgga 


1260 


t" c cr r* tr r* a c a a 


ccaatcqqta 


gatgtcaaga 


agagacgttg 


ggttaccttc 


tgctctgcag 


1320 


a a t~ ercrccaac 


etttaaegtc 


aqatqqccqc 

zj zj zy zy zj 


gagaeggcac 


ctttaaccga 


gacctcatca 


1380 


cccaaottaa 


gatcaaggtc 


ttttcacctg 


gcccgcatgg 


acacccagac 


caggtcccct 


1440 


acategtgae 


ctgggaagcc 


ttggcttttg 


acccccctcc 


ctgggtcaag 


ccctttgtac 


1500 


accctaagcc 


tccgcctcct 


cttcctccat 


ccgccccgtc 


tctccccctt 


gaacctcctc 


1560 


gttcgacccc 


gcctcgatcc 


tccctttatc 


cagccctcac 


tccttctcta 


ggcgccggaa 


1620 


ttccgatctg 


atcaagagac 


aggatgagga 


tegtttege atg att gaa 
Met He Glu 


caa gat 
Gin Asp 


1674 



1 5 

gga ttg cac gca ggt tct ccg gec get tgg gtg gag agg eta ttc ggc 1722 
Gly Leu His Ala Gly Ser Pro Ala Ala Trp Val Glu Arg Leu Phe Gly 
10 15 20 

tat gac tgg gca caa cag aca ate ggc tgc tct gat gec gee gtg ttc 1770 
Tyr Asp Trp Ala Gin Gin Thr He Gly Cys Ser Asp Ala Ala Val Phe 
25 30 35 

egg ctg tea gcg cag ggg cgc ccg gtt ctt ttt gtc aag ace gac ctg 1818 
Arg Leu Ser Ala Gin Gly Arg Pro Val Leu Phe Val Lys Thr Asp Leu 
40 45 50 

tec ggt gec ctg aat gaa ctg cag gac gag gca gcg egg eta teg tgg 1866 
Ser Gly Ala Leu Asn Glu Leu Gin Asp Glu Ala Ala Arg Leu Ser Trp 
55 60 65 

ctg gec acg acg ggc gtt cct tgc gca get gtg etc gac gtt gtc act 1914 



59 



Leu Ala Thr Thr Gly Val Pro Cys Ala Ala Val Leu Asp Val Val Thr 
70 75 80 85 



gaa gcg gga agg gac tgg ctg eta ttg ggc gaa gtg ccg ggg cag gat 1962 
Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu Val Pro Gly Gin Asp 
90 95 100 



etc ctg tea tct cac ctt get cct gee gag aaa gta tec ate atg get 
Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys Val Ser He Met Ala 
105 110 115 

gat gca atg egg egg ctg cat acg ctt gat ccg get acc tgc cca ttc 
Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro Ala Thr Cys Pro Phe 
120 125 130 

gac cac caa gcg aaa cat cgc ate gag cga gca cgt act egg atg gaa 
Asp His Gin Ala Lys His Arg He Glu Arg Ala Arg Thr Arg Met Glu 
135 140 145 



gcg cca gee gaa ctg ttc gee agg etc aag gcg cgc atg ccc gac ggc 
Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala Arg Met Pro Asp Gly 
170 175 180 



2010 



2058 



2106 



gec ggt ctt gtc gat cag gat gat ctg gac gaa gag cat cag ggg etc 2154 
Ala Gly Leu Val Asp Gin Asp Asp Leu Asp Glu Glu His Gin Gly Leu 
150 "* 155 160 165 



2202 



gag gat etc gtc gtg acc cat ggc gat gee tgc ttg ccg aat ate atg 2250 
Glu Asp Leu Val Val Thr His Gly Asp Ala Cys Leu Pro Asn He Met 
185 190 195 

gtg gaa aat ggc cgc ttt tct gga ttc ate gac tgt ggc egg ctg ggt 2298 
Val Glu Asn Gly Arg Phe Ser Gly Phe He Asp Cys Gly Arg Leu Gly 
200 205 210 

gtg gcg gac cgc tat cag gac ata gcg ttg get acc cgt gat att get 2346 
Val Ala Asp Arg Tyr Gin Asp He Ala Leu Ala Thr Arg Asp He Ala 
215 220 225 

gaa gag ctt ggc ggc gaa tgg get gac cgc ttc etc gtg ctt tac ggt 2394 
Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe Leu Val Leu Tyr Gly 
230 235 240 245 

ate gee get ccc gat teg cag cgc ate gee ttc tat cgc ctt ctt gac 2442 
He Ala Ala Pro Asp Ser Gin Arg He Ala Phe Tyr Arg Leu Leu Asp 
250 255 260 

gag ttc ttc tga gegggactet ggggttcgaa atgaccgacc aagcgacgcc 2494 
Glu Phe Phe 

caacctgcca tcacgagatt tcgattccac cgccgccttc tatgaaaggt tgggcttegg 2554 

aatcgttttc cgggacgccg gctggatgat cctccagcgc ggggatctca tgctggagtt 2614 

cttcgcccac cccgggctcg atcccctcgc gagttggttc agctgctgcc tgaggctgga 2674 



60 



cgacctcgcg gagttctacc ggcagtgcaa atccgtcggc atccaggaaa ccagcagcgg 2734 

ctatccgcgc atccatgccc ccgaactgca ggagtgggga ggcacgatgg ccgctttggt 2794 

cgaggcggat cctggaaggg ctaatttggt cccaaagaag acaagagatc cttgatctgt 2854 

ggatctacca cacacaaggc tacttccctg attggcagaa ttacacacca gggccaggga 2914 

tcagatatcc actgaccttt ggatggtgct tcaagctagt accagttgag ccagagaagg 2974 

tagaagaggc caatgaagga gagaacaaca gcttgttaca ccctatgagc ctgcatggga 3034 

tggaggacgc ggagaaagaa gtgttagtgt ggaggtttga cagcaaacta gcatttcatc 3094 

acatggcccg agagctgcat ccggagtact acaaagactg ctgacatcga gctttctaca 3154 

agggactttc cgctggggac tttccaggga ggcgtggcct gggcgggact ggggagtggc 3214 

gtccctcaga tgctgcatat aagcagctgc tttttgcctg tactgggcct cgagaagctt 3274 

gttatcacaa gtttgtacaa aaaagcaggc ttcgaaggag atagaaccaa ttctctaagg 3334 

aaatacttaa cc atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg 3385 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 
265 270 275 

ccc ate ctg gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age 3433 
Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 
280 285 290 

gtg tec ggc gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg 3481 
Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
295 300 305 

aag ttc ate tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc 3529 
Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 
310 315 320 325 

gtg acc acc ttc ggc tac ggc ctg cag tgc ttc gee cgc tac ccc gac 3577 
Val Thr Thr Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp 
330 335 340 

cac atg aag cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac 3625 
His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 
345 350 355 

gtc cag gag cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc 3673 
Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 
360 365 370 

cgc gee gag gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag 3721 
Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu 
375 380 385 

ctg aag ggc ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag 37 69 

Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys 
390 395 400 405 



61 



ctg gag tac aac tac aac age cac aac gtc tat ate atg gee gac aag 3817 
Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys 
410 415 420 

cag aag aac ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag 3865 
Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu 
425 430 435 

gac ggc age gtg cag etc gec gac cac tac cag cag aac ace ccc ate 
Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He 
440 445 450 

ggc gac ggc ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag 
Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin 
455 460 465 

tec gee ctg age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg 
Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
470 475 480 485 

ctg gag ttc gtg ace gee gee ggg ate act etc ggc atg gac gag ctg 
Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu 
490 495 500 

tac aag taa agcggccgca ctcgagatat ctagacccag ctttcttgta 4106 
Tyr Lys 



3913 



3961 



4009 



4057 



caaagtggtg 


ataacatcga . 


taaaataaaa 


gattttattt 


agtctccaga 


aaaagggggg 


4166 


aatgaaagac 


cccacctgta 


ggtttggcaa 


gctagcttaa 


gtaacgecat 


tttgeaagge 


4226 


atggaaaaat 


acataactga 


gaatagagaa 


gttcagatca 


aggtcaggaa 


cagatggaac 


4286 


agctgaatat 


gggecaaaca 


ggatatctgt 


ggtaagcagt 


tcctgccccg 


gctcagggcc 


4346 


aagaacagat 


ggaacagctg 


aatatgggcc 


aaacaggata 


tctgtggtaa 


gcagttcctg 


4406 


ccccggctca 


gggccaagaa 


cagatggtcc 


ecagatgegg 


tccagccctc 


agcagtttct 


4466 


agagaaccat 


cagatgtttc 


cagggtgccc 


caaggacctg 


aaatgaccct 


gtgecttatt 


4526 


tgaactaacc 


aatcagttcg 


cttctcgctt 


ctgttcgcgc 


gcttctgctc 


cccgagctca 


4586 


ataaaagagc 


ccacaacccc 


teactegggg 


cgccagtcct 


ccgattgact 


gagtcgcccg 


4646 


ggtacccgtg 


tatccaataa 


accctcttgc 


agttgcatcc 


gacttgtggt 


ctcgctgttc 


4706 


cttgggaggg 


tctcctctga 


gtgattgact 


acccgtcagc 


gggggtcttt 


catttggggg 


4766 


ctcgtccggg 


ategggagae 


ccctgcccag 


ggaccaccga 


cccaccaccg 


ggaggtaagc 


4826 


tggctgcctc 


gcgcgtttcg 


gtgatgaegg 


tgaaaacctc 


tgacacatgc 


agctcccgga 


4886 


gaeggtcaca 


gcttgtctgt 


aagcggatgc 


egggagcaga 


caagcccgtc 


agggegegtc 


4946 



62 



aacaQQtatt 

O -J 3) w -3 


ggcgggtgtc 


ggggcgcagc 


catgacccag 


tcacgtagcg 


atagcggagt 


5006 


atatactaqc 


ttaactatgc 


ggcatcagag 


cagattgtac 


tgagagtgca 


ccatatgcgg 


5066 


tgtgaaatac 


cgcacagatg 


cgtaaggaga 


aaataccgca 


tcaggcgctc 


ttccgcttcc 


5126 


tcgctcactg 


actcgctgcg 


ctcggtcgtt 


cggctgcggc 


gagcggtatc 


agctcactca 


5186 


aaggcggtaa 


tacggttatc 


cacagaatca 


ggggataacg 


caggaaagaa 


catgtgagca 


5246 


aaaggccagc 


aaaaggccag 


gaaccgtaaa 


aaggccgcgt 


tgctggcgtt 


tttccatagg 


5306 


ctccgccccc 


ctgacgagca 


tcacaaaaat 


cgacgctcaa 


gtcagaggtg 


gcgaaacccg 


5366 


acaggactat 


aaagatacca 


ggcgtttccc 


cctggaagct 


ccctcgtgcg 


ctctcctgtt 


5426 


ccgaccctgc 


cgcttaccgg 


atacctgtcc 


gcctttctcc 


cttcgggaag 


cgtggcgctt 


5486 


tctcatagct 


cacgctgtag 


gtatctcagt 


tcggtgtagg 


tcgttcgctc 


caagctgggc 


5546 


tatcrtacaca 


aaccccccgt 


tcagcccgac 


cgctgcgcct 


tatccggtaa 


ctatcgtctt 


5606 


gagtccaacc 


cggtaagaca 


cgacttatcg 


ccactggcag 


cagccactgg 


taacaggatt 


5666 


aacacjacrccia 


qqtatgtagg 


cggtgctaca 


gagttcttga 


agtggtggcc 


taactacggc 


5726 


tacactagaa 


ggacagtatt 


tggtatctgc 


gctctgctga 


agccagttac 


cttcggaaaa 


5786 


aaacrt t aat a 


gctcttgatc 


cggcaaacaa 


accaccgctg 


gtagcggtgg 


tttttttgtt 


5846 


tgcaagcagc 


agattacgcg 


cagaaaaaaa 


ggatctcaag 


aagatccttt 


gatcttttct 


5906 


acaaqatctg 


acgctcagtg 


gaacgaaaac 


tcacgttaag 


ggattttggt 


catgagatta 


5966 


t caaaaagga 


tcttcaccta 


gatcctttta 


aattaaaaat 


gaagttttaa 


atcaatctaa 


6026 


agtatatatg 


agtaaacttg 


gtctgacagt 


taccaatgct 


taatcagtga 


ggcacctatc 


6086 


tcagcgatct 


gtctatttcg 


ttcatccata 


gttgcctgac 


tccccgtcgt 


gtagataact 


6146 


acaatacacrcr 


agggcttacc 


atctggcccc 


agtgctgcaa 


tgataccgcg 


agacccacgc 


6206 


tcaccggctc 


cagatttatc 


agcaataaac 


cagccagccg 


gaagggccga 


gcgcagaagt 


6266 


aatcctocaa 


ctttatccgc 


ctccatccag 


tctattaatt 


gttgccggga 


agctagagta 


6326 


aataattcac 


cagttaatag 


tttgcgcaac 


gttgttgcca 


ttgctgcagg 


catcgtggtg 


6386 




rat* t t~crcrt* a t 


tract tcattc 


agctccggtt 


cccaacgatc 


aaggcgagtt 


6446 


acatgatccc 


ccatgttgtg 


caaaaaagcg 


gttagctcct 


tcggtcctcc 


gatcgttgtc 


c tr c\ a 


agaagtaagt 


tggccgcagt 


gttatcactc 


atggttatgg 


cagcactgca 


taattctctt 


6566 


actgtcatgc 


catccgtaag 


atgcttttct 


gtgactggtg 


agtactcaac 


caagtcattc 


6626 


tgagaatagt 


gtatgcggcg 


accgagttgc 


tcttgcccgg 


cgtcaacacg 


ggataatacc 


6686 



gcgccacata 


gcagaacttt 


aaaagtgctc 


atcattggaa aacgttcttc ggggcgaaaa 


6746 


ctctcaagga 


tcttaccgct 


gttgagatcc 


agttcgatgt 


aacccactcg 


tgcacccaac 


6806 


tgatcttcag 


catcttttac 


tttcaccagc 


gtttctgggt 


gagcaaaaac 


aggaaggcaa 


6866 


aatgccgcaa 


aaaagggaat 


aagggcgaca 


cggaaatgtt 


gaatactcat 


actcttcctt 


6926 


tttcaatatt 


attgaagcat 


ttatcagggt 


tattgtctca 


tgagcggata 


catatttgaa 


6986 


tgtatttaga 


aaaataaaca 


aataggggtt 


ccgcgcacat 


ttccccgaaa 


agtgccacct 


7046 


gacgtctaag 


aaaccattat 


tatcatgaca 


ttaacctata 


aaaataggcg 


tatcacgagg 


7106 


ccctttcgtc 


ttcaa 










7121 



<210> 26 

<211> 264 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 26 

Met lie Glu Gin Asp Gly Leu His Ala Gly Ser Pro Ala Ala Trp Val 
15 10 15 



Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gin Gin Thr He Gly Cys Ser 
20 25 30 



Asp Ala Ala Val Phe Arg Leu Ser Ala Gin Gly Arg Pro Val Leu Phe 
35 40 45 



Val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gin Asp Glu Ala 
50 55 60 



Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly Val Pro Cys Ala Ala Val 
65 70 75 80 



Leu Asp Val Val Thr Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu 
85 90 95 



Val Pro Gly Gin Asp Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys 
100 105 110 



Val Ser He Met Ala Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro 



64 



115 



120 



125 



Ala Thr Cys Pro Phe Asp His Gin Ala Lys His Arg He Glu Arg Ala 
130 135 140 



Arg Thr Arg Met Glu Ala Gly Leu Val Asp Gin Asp Asp Leu Asp Glu 
145 " 150 155 160 



Glu His Gin Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala 
165 170 175 



Arg Met Pro Asp Gly Glu Asp Leu Val Val Thr His Gly Asp Ala Cys 
180 185 190 



Leu Pro Asn He Met Val Glu Asn Gly Arg Phe Ser Gly Phe He Asp 
195 200 205 



Cys Gly Arg Leu Gly Val Ala Asp Arg Tyr Gin Asp He Ala Leu Ala 
210 215 220 



Thr Arg Asp He Ala Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe 
225 230 235 240 



Leu Val Leu Tyr Gly He Ala Ala Pro Asp Ser Gin Arg He Ala Phe 
245 250 255 



Tyr Arg Leu Leu Asp Glu Phe Phe 
260 



<210> 


27 


<211> 


239 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic i 


<400> 


27 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 



Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 
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Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 * 90 95 



Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 HO 



Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 



He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 



Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 



Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 28 

<211> 7121 

<212> DNA 

<213> Artificial 
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<220> 

<223> Engineered sequence form virus and plasmid 



<220> 

<221> CDS 

<222> (206) . . (1066) 

<223> Ampicillin resistance 



<400> 28 

ttgaagacga aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat 60 

ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt 120 

atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct 180 

tcaataatat tgaaaaagga agagt atg agt att caa cat ttc cgt gtc gcc 232 

Met Ser lie Gin His Phe Arg Val Ala 
1 5 

ctt att ccc ttt ttt gcg gca ttt tgc ctt cct gtt ttt get cac cca 280 
Leu lie Pro Phe Phe Ala Ala Phe Cys Leu Pro Val Phe Ala His Pro 
10 15 20 25 

gaa acg ctg gtg aaa gta aaa gat get gaa gat cag ttg ggt gca cga 328 
Glu Thr Leu Val Lys Val Lys Asp Ala Glu Asp Gin Leu Gly Ala Arg 
30 35 40 

gtg ggt tac ate gaa ctg gat etc aac age ggt aag ate ctt gag agt 376 
Val Gly Tyr He Glu Leu Asp Leu Asn Ser Gly Lys He Leu Glu Ser 
45 50 55 

ttt cgc ccc gaa gaa cgt ttt cca atg atg age act ttt aaa gtt ctg 424 
Phe Arg Pro Glu Glu Arg Phe Pro Met Met Ser Thr Phe Lys Val Leu 
60 65 70 

eta tgt gge gcg gta tta tec cgt gtt gac gcc ggg caa gag caa etc 472 
Leu Cys Gly Ala Val Leu Ser Arg Val Asp Ala Gly Gin Glu Gin Leu 
75 80 85 

ggt cgc cgc ata cac tat tct cag aat gac ttg gtt gag tac tea cca 
Gly Arg Arg He His Tyr Ser Gin Asn Asp Leu Val Glu Tyr Ser Pro 
90 95 100 105 

gtc aca gaa aag cat ctt acg gat gge atg aca gta aga gaa tta tgc 
Val Thr Glu Lys His Leu Thr Asp Gly Met Thr Val Arg Glu Leu Cys 
110 115 120 

agt get gcc ata ace atg agt gat aac act gcg gcc aac tta ctt ctg 
Ser Ala Ala He Thr Met Ser Asp Asn Thr Ala Ala Asn Leu Leu Leu 
125 130 135 

aca acg ate gga gga ccg aag gag eta ace get ttt ttg cac aac atg 664 
Thr Thr He Gly Gly Pro Lys Glu Leu Thr Ala Phe Leu His Asn Met 
140 145 150 

ggg gat cat gta act cgc ctt gat cgt tgg gaa ccg gag ctg aat gaa 712 



520 



568 



616 
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Gly Asp His Val Thr Arg Leu Asp Arg Trp Glu Pro Glu Leu Asn Glu 
155 160 165 

gcc ata cca aac gac gag cgt gac acc acg atg cct gca gca atg gca 
Ala He Pro Asn Asp Glu Arg Asp Thr Thr Met Pro Ala Ala Met Ala 
170 175 180 185 

aca acg ttg cgc aaa eta tta act ggc gaa eta ctt act eta get tec 
Thr Thr Leu Arg Lys Leu Leu Thr Gly Glu Leu Leu Thr Leu Ala Ser 
190 195 200 

egg caa caa tta ata gac tgg atg gag gcg gat aaa gtt gca gga cca 
Arg Gin Gin Leu He Asp Trp Met Glu Ala Asp Lys Val Ala Gly Pro 
205 210 215 

ctt ctg cgc teg gcc ctt ccg get ggc tgg ttt att get gat aaa tct 
Leu Leu Arg Ser Ala Leu Pro Ala Gly Trp Phe He Ala Asp Lys Ser 
220 225 230 

gga gcc ggt gag cgt ggg tct cgc ggt ate att gca gca ctg ggg cca 
Gly Ala Gly Glu Arg Gly Ser Arg Gly He He Ala Ala Leu Gly Pro 
235 240 245 

gat ggt aag ccc tec cgt ate gta gtt ate tac acg acg ggg agt cag 
Asp Gly Lys Pro Ser Arg He Val Val He Tyr Thr Thr Gly Ser Gin 
250 * ^ 255 260 265 

gca act atg gat gaa cga aat aga cag ate get gag ata ggt gcc tea 
Ala Thr Met Asp Glu Arg Asn Arg Gin He Ala Glu He Gly Ala Ser 
270 275 280 

ctg att aag cat tgg taa ctgtcagacc aagtttactc atatatactt 
Leu He Lys His Trp 
285 

tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagegtc agaccccgta 
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 
tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag 
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttacegg gttggactca 
agacgatagt taceggataa ggegcagegg tegggctgaa eggggggtte gtgcacacag 
cccagcttgg agegaacgae ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc eggtaagegg cagggtegga 
acaggagagc gcacgaggga gcttccaggg ggaaacgect ggtatcttta tagtcctgtc 
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gggtttcgcc 


acctctgact 


tgagcgtcga tttttgtgat 


gctcgtcagg ggggcggagc 


1816 


ctatggaaaa 


acgccagcaa 


cgcggccttt ttacggttcc 


tggccttttg 


ctggcctttt 


1876 


gctcacatgt 


tctttcctgc gttatcccct gattctgtgg ataaccgtat 


taccgccttt 


1936 


gagtgagctg ataccgctcg ccgcagccga acgaccgagc 


gcagcgagtc agtgagcgag 


1996 


gaagcggaag 


agcgcctgat 


gcggtatttt ctccttacgc 


atctgtgcgg 


tatttcacac 


2056 


cgcatatggt 


gcactctcag 


tacaatctgc tctgatgccg 


catagttaag 


ccagtataca 


2116 


ctccgctatc 


gctacgtgac 


tgggtcatgg ctgcgccccg acacccgcca 


acacccgctg 


2176 


acgcgccctg acgggcttgt ctgctcccgg catccgctta cagacaagct 


gtgaccgtct 


2236 


ccgggagctg 


catgtgtcag 


aggttttcac cgtcatcacc 


gaaacgcgcg 


aggcagccag 


2296 


cttacctccc ggtggtgggt 


cggtggtccc tgggcagggg 


tctcccgatc 


ccggacgagc 


2356 


ccccaaatga 


aagacccccg 


ctgacgggta gtcaatcact 


cagaggagac 


cctcccaagg 


2416 


aacagcgaga 


ccacaagtcg 


gatgcaactg caagagggtt 


tattggatac 


acgggtaccc 


2476 


gggcgactca 


gtcaatcgga 


ggactggcgc cccgagtgag 


gggttgtggg 


ctcttttatt 


2536 


gagctcgggg 


agcagaagcg 


cgcgaacaga agcgagaagc 


gaactgattg 


gttagttcaa 


2596 


ataaggcaca gggtcatttc aggtccttgg ggcaccctgg aaacatctga tggttctcta 


2656 


gaaactgctg 


agggctggac 


cgcatctggg gaccatctgt 


tcttggccct 


gagccggggc 


2716 


aggaactgct 


taccacagat 


atcctgtttg gcccatattc 


agctgttcca 


tctgttcttg 


2776 


gccctgagcc 


ggggcaggaa 


ctgcttacca cagatatcct 


gtttggccca 


tattcagctg 


2836 


ttccatctgt 


tcctgacctt 


gatctgaact tctctattct 


cagttatgta 


tttttccatg 


2896 


ccttgcaaaa 


tggcgttact 


taagctagct tgccaaacct acaggtgggg 


tctttcattc 


2956 


cccccttttt 


ctggagacta 


aataaaatct tttattttat 


cgatgttatc 


accactttgt 


3016 


acaagaaagc 


tgggtctaga 


tatctcgagt gcggccgctt 


tacttgtaca 


gctcgtccat 


3076 


gccgagagtg atcccggcgg cggtcacgaa ctccagcagg 


accatgtgat 


cgcgcttctc 


3136 


gttggggtct 


ttgctcaggg 


cggactggta gctcaggtag 


tggttgtcgg 


gcagcagcac 


3196 


ggggccgtcg 


ccgatggggg 


tgttctgctg gtagtggtcg 


gcgagctgca 


cgctgccgtc 


3256 


ctcgatgttg 


tggcggatct 


tgaagttcac cttgatgccg 


ttcttctgct 


tgtcggccat 


3316 


gatatagacg 


ttgtggctgt 


tgtagttgta ctccagcttg 


tgccccagga 


tgttgccgtc 


3376 


ctccttgaag 


tcgatgccct 


tcagctcgat gcggttcacc 


agggtgtcgc 


cctcgaactt 


3436 


cacctcggcg 


cgggtcttgt 


agttgccgtc gtccttgaag 


aagatggtgc 


gctcctggac 


3496 
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gtagccttcg ggcatggcgg acttgaagaa gtcgtgctgc 


ttcatgtggt 


cggggtagcg 


3556 


ggcgaagcac 


tgcaggccgt 


agccgaaggt 


ggtcacgagg 


gtgggccagg 


gcacgggcag 


3616 


cttgccggtg gtgcagatga acttcagggt cagcttgccg 


taggtggcat 


cgccctcgcc 


3676 


ctcgccggac 


acgctgaact 


tgtggccgtt 


tacgtcgccg 


tccagctcga 


ccaggatggg 


3736 


caccaccccg 


gtgaacagct cctcgccctt gctcaccatg gttaagtatt 


tccttagaga 


3796 


attggttcta 


tctccttcga agcctgcttt 


tttgtacaaa 


cttgtgataa 


caagcttctc 


3856 


gaggcccagt 


acaggcaaaa 


agcagctgct 


tatatgcagc 


atctgaggga 


cgccactccc 


3916 


cagtcccgcc 


caggccacgc 


ctccctggaa 


agtccccagc 


ggaaagtccc 


ttgtagaaag 


3976 


ctcgatgtca 


gcagtctttg 


tagtactccg gatgcagctc 


tcgggccatg 


tgatgaaatg 


4036 


ctagtttgct 


gtcaaacctc 


cacactaaca 


cttctttctc 


cgcgtcctcc 


atcccatgca 


4096 


ggctcatagg 


gtgtaacaag 


ctgttgttct 


ctccttcatt 


ggcctcttct 


accttctctg 


4156 


gctcaactgg 


tactagcttg 


aagcaccatc 


caaaggtcag 


tggatatctg 


atccctggcc 


4216 


ctggtgtgta attctgccaa 


tcagggaagt 


agccttgtgt 


gtggtagatc 


cacagatcaa 


4276 


ggatctcttg 


tcttctttgg 


gaccaaatta 


gcccttccag 


gatccgcctc 


gaccaaagcg 


4336 


gccatcgtgc 


ctccccactc 


ctgcagttcg 


ggggcatgga 


tgcgcggata 


gccgctgctg 


4396 


gtttcctgga 


tgccgacgga 


tttgcactgc 


cggtagaact 


ccgcgaggtc 


gtccagcctc 


4456 


aggcagcagc 


tgaaccaact 


cgcgagggga 


tcgagcccgg 


ggtgggcgaa 


gaactccagc 


4516 


atgagatccc 


cgcgctggag 


gatcatccag 


ccggcgtccc 


ggaaaacgat 


tccgaagccc 


4576 


aacctttcat 


agaaggcggc 


ggtggaatcg 


aaatctcgtg 


atggcaggtt 


gggcgtcgct 


4636 


tggtcggtca 


tttcgaaccc 


cagagtcccg 


ctcagaagaa 


ctcgtcaaga 


aggcgataga 


4696 


aggcgatgcg 


ctgcgaatcg 


ggagcggcga 


taccgtaaag 


cacgaggaag 


cggtcagccc 


4756 


attcgccgcc 


aagctcttca 


gcaatatcac 


gggtagccaa 


cgctatgtcc 


tgatagcggt 


4816 


ccgccacacc 


cagccggcca 


cagtcgatga atccagaaaa gcggccattt tccaccatga 


4876 


tattcggcaa gcaggcatcg 


ccatgggtca 


cgacgagatc 


ctcgccgtcg ggcatgcgcg 


4936 


ccttgagcct 


ggcgaacagt 


tcggctggcg 


cgagcccctg 


atgctcttcg 


tccagatcat 


4996 


cctgatcgac 


aagaccggct 


tccatccgag 


tacgtgctcg 


ctcgatgcga 


tgtttcgctt 


5056 


ggtggtcgaa 


tgggcaggta 


gccggatcaa 


gcgtatgcag 


ccgccgcatt 


gcatcagcca 


5116 


tgatggatac 


tttctcggca 


ggagcaaggt 


gagatgacag 


gagatcctgc 


cccggcactt 


5176 
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cgcccaatag 


cagccagtcc 


cttcccgctt 


cagtgacaac 


gtcgagcaca 


gctgcgcaag 


5236 


gaacgcccgt 


cgtggccagc 


cacgatagcc 


gcgctgcctc 


gtcctgcagt 


tcattcaggg 


5296 


caccggacag 


gtcggtcttg acaaaaagaa ccgggcgccc 


ctgcgctgac 


agccggaaca 


5356 


cggcggcatc agagcagccg 


attgtctgtt gtgcccagtc atagccgaat agcctctcca 


5416 


cccaagcggc 


cggagaacct 


gcgtgcaatc 


catcttgttc 


aatcatgcga 


aacgatcctc 


5476 


atcctgtctc 


ttgatcagat 


cggaattccg 


gcgcctagag 


aaggagtgag 


ggctggataa 


5536 


agggaggatc 


gaggcggggt 


cgaacgagga 


ggttcaaggg ggagagacgg ggcggatgga 


5596 


ggaagaggag 


gcggaggctt 


agggtgtaca 


aagggcttga 


cccagggagg 


ggggtcaaaa 


5656 


gccaaggctt 


cccaggtcac 


gatgtagggg 


acctggtctg 


ggtgtccatg 


cgggccaggt 


5716 


gaaaagacct 


tgatcttaac 


ctgggtgatg 


aggtctcggt 


taaaggtgcc 


gtctcgcggc 


5776 


catccgacgt 


taaaggttgg 


ccattctgca 


gagcagaagg 


taacccaacg 


tctcttcttg 


5836 


acatctaccg 


actggttgtg 


agcgatccgc 


tcgacatctt 


tccagtgacc 


taaggtcaaa 


5896 


cttaagggag 


tggtaacagt 


ctggccctaa 


ttttcagaca 


aatacagaaa 


cacagtcaga 


5956 


cagagacaac 


acagaacgat 


gctgcagcgc 


tgcagcagac aagacgcgcg gcttcggttc 


6016 


caaaccgaaa 


gcaaaaattc 


agacggaggc 


gggaactgtt 


ttaggttctc 


gtctcctacc 


6076 


agaaccacat 


atcctgacgg 


ggtcggattc 


cacatcgact 


cccttcctca 


ggtcgggcca 


6136 


caaaaacggc 


ccccaaagtc 


cctgggacgt 


ctcccagggt 


tgcggccggg 


tgttcagaac 


6196 


tcgtcagttc 


caccacgggt 


ccgccagata 


cagagctagt 


tagctaacta gtacagacgc 


6256 


aggcgcataa 


catcaaacat 


agacactaga 


caatcggaca 


gacacagata 


agttgctggc 


6316 


cagcttacct 


cccggtggtg 


ggtcggtggt 


ccctgggcag 


gggtctccaa 


atcccggacg 


6376 


agcccccaaa 


tgaaagaccc 


ccgtcgtggg 


tagtcaatca 


ctcagaggag 


accctcccaa 


6436 


ggaacagcga 


gaccacgatt 


cggatgcaaa 


cagcaagagg 


ctttattggg 


aatacgggta 


6496 


cccgggcgac 


gcagtctatc 


ggaagactgg 


cgcgccgagt 


gaggggttgt 


gggctctttt 


6556 


attgagctcg 


gagagcggaa 


gcgcgcgaac 


agaagcgaga 


agcgaactga 


ttggttagtt 


6616 


caaataaggt 


acagggtcat 


tttcaggtcc 


ttggggcacc 


ctggaaacat 


ctgatgattc 


6676 


actagaaact gctgagggct 


ggaccgcatc 


tggggaccat 


ctgttcttgg 


ccccgagccg 


6736 


gggcaggaac 


tgcttaccac 


agatatcctg 


tttggcccat 


cactcagctg 


tctcatctgt 


6796 


tcttggccct 


gagccggggc 


aggaaccgct 


taccacagat 


atcctgtttg 


gtattcagct 


6856 


gtttctttgt 


tcctgacctt 


gatctgaact 


tttctattct 


cagttatgta 


tttttccatg 


6916 
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ccttgcaaag tggcgttact taagctagct tgccacctac gggtggggtc tttcaaaagc 6976 
aaagaaacgg aagggccgcg gctttggctc cggtgagtgt ggggaatagg gtagagtggt 7036 
ctaagaggca gaagcccgcg aatttgggag tgtgaggggg acacatttgg aggacagttt 7096 

71 91 

tcggtgatct ggtatgaatt aattc 



<210> 29 

<211> 286 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 29 

Met Ser He Gin His Phe Arg Val Ala Leu He Pro Phe Phe Ala Ala 
15 10 15 



Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr Leu Val Lys Val Lys 
20 25 30 



Asp Ala Glu Asp Gin Leu Gly Ala Arg Val Gly Tyr He Glu Leu Asp 

35 40 45 

Leu Asn Ser Gly Lys He Leu Glu Ser Phe Arg Pro Glu Glu Arg Phe 
50 55 60 



Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys Gly Ala Val Leu Ser 
65 70 75 80 

Arg Val Asp Ala Gly Gin Glu Gin Leu Gly Arg Arg He His Tyr Ser 
85 90 95 



Gin Asn Asp Leu Val Glu Tyr Ser Pro Val Thr Glu Lys His Leu Thr 
100 105 110 



Asp Gly Met Thr Val Arg Glu Leu Cys Ser Ala Ala He Thr Met Ser 
115 120 125 



Asp Asn Thr Ala Ala Asn Leu Leu Leu Thr Thr He Gly Gly Pro Lys 
130 135 140 



Glu Leu Thr Ala Phe Leu His Asn Met Gly Asp His Val Thr Arg Leu 

72 



145 




150K^ SI 155 160 

Asp Arg Trp Glu Pro Glu Leu Asn Glu Ala He Pro Asn Asp Glu Arg 
165 170 175 



Asp Thr Thr Met Pro Ala Ala Met Ala Thr Thr Leu Arg Lys Leu Leu 
180 185 190 



Thr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gin Gin Leu He Asp Trp 
195 200 205 

Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu Arg Ser Ala Leu Pro 
210 215 220 



Ala Gly Trp Phe He Ala Asp Lys Ser Gly Ala Gly Glu Arg Gly Ser 
225 ' 230 235 240 



Arg Gly He He Ala Ala Leu Gly Pro Asp Gly Lys Pro Ser Arg He 
245 250 255 



Val Val He Tyr Thr Thr Gly Ser Gin Ala Thr Met Asp Glu Arg Asn 
260 265 270 



Arg Gin He Ala Glu He Gly Ala Ser Leu He Lys His Trp 
275 280 285 
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